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ABSTRACT 

A discrete state, discrete time Markov-type manpower flow 
model having a two-dimensional state space (i.e., a two 
characteristic model) is analyzed. The probabilistic proper- 
ties of the model and the equations of stocks and flows are 
developed. A new method of representing the stocks as a sum 
of steady-state and transient components is presented. Two 
specific applications of the model to multi-grade hierarchical 
systems in which the dimensions of the state space are (grade, 
length of service) and (grade, time in grade) are analyzed in 
detail. The problem of combining states across the second 
dimension of the state space is studied and methods are 
derived for the (usual) case where the states are not lumpable. 
Finally, some applications of the model with state space 
(grade, skill group) to retraining problems between various 
skill categories are presented. 
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I . INTRODUCTION 



This paper deals with manpower flow models, specifically 
discrete state, discrete time Markov-type models in which 
the states are described by two characteristics. This type 
of manpower flow model will be referred to as a "two- 
characteristic model" to distinguish it from the more common 
model in which the states are described by one characteristic. 

Markov- type flow models have been used for some years; 
see for example Young and Almond (1961) , Blumen, Kogan and 
McCarthy (1955) , Marshall and Oliver (1970) and Rowland and 
Sovereign (1969) . Theoretical properties of the model are 
developed in Bartholomew (1967) . Charnes, Cooper and Niehaus 
(1972) use the Markov-type flow model as part of a larger 
manpower planning model. 

The latter application is typical in that the Markov-type 
flow model is not used in isolation to project stocks and 
flows of people, but rather it is used to provide information 
to a more extensive model that deals with budgets, capital 
investments and aggregate work planning. The simple mathe- 
matical structure and the computational tractability of 
Markov-type models make it practical to embed them in larger 
models. 

Early Markov-type models had one-characteristic states, 
such as pay grade or status in a hierarchical system. 

Geometric lifetimes in these states were thus implied by the 
model structure and often could not be justified. However, 
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the one-characteristic model usually led to a small state 
space which implied little computational difficulty. The 
simple mathematical structure of these models was also 
appealing, and analytical results were obtainable. 

In an attempt to overcome the shortcomings of these 
simple models subsequent models in manpower planning used 
more than one characteristic to describe the states of the 
system. See for example the U.S. Navy ADSTAP models (U.S. 
Navy, 1973) and the U.S. Air Force TOPLINE model (RAND, 1973). 
In the documentation we have seen these have not been formally 
recognized as Markov-type models. Each model has been treated 
as an individual case. By viewing these models as special 
cases of a more general two-characteristic Markov-type model, 
we present in this thesis a unified treatment of such a model. 
We are able to develop the theory of its structure and we use 
this to analyze special cases frequently applied in military 
manpower planning. Some new applications of the model to the 
problem of retraining between skill groups in an organization 
are presented. 

A. GENERAL DESCRIPTION 

A graded manpower system is one in which each person is 
assigned a "grade," e.g., a pay grade, a rank or a status 
label. Such a system may be analyzed using a Markov-type 
model in which the states are defined to be the grades. 

Thus a state is identified by a single characteristic and we 
say the model has a one-dimensional state space. The analysis 
of a graded manpower system by such models is often suspect 
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because the grades do not provide a state space within which 
the system is actually Markovian. For example those in a 
given grade who progress rapidly through the lower grades may 
have significantly different promotion prospects from those 
in the same grade whose progress was less rapid. Thus the 
future progress of an individual in a state depends on how 
he reached the given state. 

As is frequently the case in Markov models, expanding 
the state space may lead to a model of the system in which 
the Markov assumption is more realistic. In this paper the 
state space of a Markov-type model of a graded manpower sys- 
tem is expanded to one in which each state is identified by 
a couple (i,j) where i represents a grade and j represents 
a second characteristic. Examples of a second characteristic 
are: 

1) Length of service in the system (LOS) 

2) Time in grade (TIG) 

3) Skill category 

4) Overall performance mark 

5) Physical location. 

In cases 1) and 2) the second characteristic is chosen 
primarily to make the Markov assumption more tenable. In the 
other cases the appropriate second characteristic is chosen 
to fit the problem at hand. 

Expanding the state space of a Markov-type model presents 
some difficulties, among which are: 

1) There are more parameters (transition probabilities) 
in the model to be estimated or controlled. 
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This paper does not deal with the problems of parameter 
estimation. In practice the estimation of transition proba- 
bilities in manpower systems is hindered more by lack of 
data than by questions regarding appropriate statistical 
techniques. More pertinent is the number of parameters to 
be controlled, because the models analyzed here are intended 
for planning rather than forecasting. That is, we are inter- 
ested in comparing the results of alternative policies rather 
than projecting the consequences of some "state of nature." 
Increasing the number of parameters to be controlled in a 
planning model can lead to practical difficulties. Such 
difficulties are not investigated in a comprehensive fashion 
in this paper; however, in Section D of Chapter IV and 
Sections C and D of Chapter VII we demonstrate methods which 
effectively reduce the number of parameters the planner must 
control. 

2) Computations may be less tractable, if not completely 
impractical. 

The manpower flow model, is typically a component of a larger 
planning model, so it is to be expected that a single run of 
the planning model will require numerous calls for information 
from the flow model. Consider a system in which two grades i 
and k are important plateaus to be reached which affect an 
individual's retirement benefits. A retirement policy planning 
model might require information on the fraction of people who 
will attain grade k given that grade i is attained as well as 
the distribution of time required to reach grade k. Practical 
methods for deriving this type of information from a 
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two-characteristic model are developed in Chapters II and III 
Although a two-characteristic model in a military manpower 
application would have at least one hundred states, it is 
shown that much information of interest can be derived by 
calculations restricted to a smaller number of states. 

3) Analytical results which yield insight into the 
properties of the system may be more difficult to 
obtain. 

Certainly analytical results are more difficult to obtain in 
a two-characteristic model than in a one-characteristic model 
Nevertheless, we obtain a number of results which include the 
structure of the fundamental matrix (Chapter II) , the equi- 
librium distribution of people in the system under various 
hiring policies (Chapter III) , the effect of average time 
spent in a grade on the equilibrium distribution of people in 
that grade (Chapter VI) and the relation between promotion 
rates and the number in a grade (Chapter VII) . 

B. CONTENTS AND SUMMARY 

Chapter II begins with the definition of basic notation 
and the statement of assumptions. Probabilistic properties 
of the transient matrix of the two-characteristic model, such 
as the expected number of visits to a state and the expected 
time spent in a grade are developed. A not uncommon property 
of manpower flow models, the 0-1 visiting property" (i.e., 
each state can be visited no more than once) , is defined and 
its implications discussed. The "no return property" (i.e., 
a transition into a state is impossible once a transition has 
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been made out of it) is defined. If the states in a grade 
have the no return property then it may be assumed without 
loss of generality that the fundamental matrix for that 
grade is upper triangular and thus is relatively easy to 
compute. Matrices of the probabilities of going from one 
state to another in t steps (t-step transition matrices) are 
defined and a recursive formula for their calculation is 
shown. The t-step transition matrices are subsequently used 
in Chapter III to represent the stocks as the sum of steady- 
state and transient components. In the last section of 

N 

Chapter II is it shown how the basic probabilistic properties 
of the transient matrix of the two-characteristic model are 
developed when one conditions on the attainment of a higher 
grade. The results of Chapter II follow closely the results 
of Kemeny and Snell (1960) , but extend them to the two- 
characteristic model. Thus Chapter II contains a compre- 
hensive treatment of how probabilistic information may be 
obtained from a two-characteristic model, and such a treat- 
ment has not been previously published. 

Chapter III begins with a definition of the timing 
convention. In real systems people enter, leave and change 
grades or skills continuously. In order to fit a Markov- 
type model in discrete time the notion of a "period" must 
be carefully defined and a consistent counting convention 
used. Since planners almost invariably use discrete planning 
periods, these conventions lead to useable results. We 
assume time is divided into discrete periods (for example, 
years) . Each period is denoted by the integer value of time 
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at the end of the period. People are counted only at the 
last instant of a period. Various stock and flow vectors 
are defined; the term stock refers to the number of people 
in a state or set of states at the end of a period, the 
term flow refers to the number of people who make transitions 
from one state in one period to another state in the next 
period. In Section B we develop equations for computing the 
expected stocks at time t from the values of stocks and flows 
prior to time t. Section C contains the development of a new 
method for expressing the stock vector as a sum of a "steady- 
state" component and a "transient" component. The steady- 
state component of the stock vector is derived for the cases 
where external flows are 

1) Constant 

2) Growing linearly, 

3) Growing geometrically. 

Chapter IV deals with the two-characteristic model where 
length of service (LOS) is the second characteristic. This 
model is called the (grade, LOS) model. The chapter begins 
with background material showing how practical considerations 
might lead the manpower planner to use the (grade, LOS) model. 
In Section B we present definitions and display some of the 
matrices associated with the (grade, LOS) model in order to 
show their rather simple structure. The concept of a cohort 
(a group of people who enter the system at the same time and 
in the same state) is discussed. It is noted that when entry 
to the system is restricted to a single state, the (grade, 

LOS) model preserves cohorts in the sense that members of 
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different cohorts never simultaneously occupy the same state. 
Section C discusses computationally efficient methods for 
performing the matrix multiplications needed to derive 
information from the (grade, LOS) model. The discussion in 
this section depends on Appendix B which treats the multi- 
plication of "diagonal matrices." Section D shows how the 
structure of the (grade, LOS) model might be exploited in 
solving a problem regarding the minimization of the costs of 
reenlistment (in a military organization) . In the last 
section of Chapter III, Section E, we discuss how the 
modeler may treat grade as the second characteristic and LOS 
as the first characteristic. Such an interchanged model is 
called the (LOS, grade) model. It is shown that the (LOS, 
grade) model is more flexible than the (grade, LOS) model. 

Chapter V deals with the two-characteristic model where 

time in grade (TIG) is the second characteristic. This model 

is called the (grade, TIG) model. The chapter begins with a 

brief discussion of conditions under which the model might be 

appropriate. In Section B the definitions are stated and the 

v 

structure of various matrices associated with the (grade, 

TIG) model is displayed and discussed. It is noted that the 
(grade, TIG) model is equivalent to a discrete semi-Markov 
process. In Section C we discuss how the modeler may treat 
grade as the second characteristic and .TIG as the first 
characteristic. The resulting model, the (TIG, grade) model, 
is different in structure from the two-characteristic models 
previously studied. However it is briefly shown how one 
might derive information from such a model. 
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Chapter VI treats the problem of combining states in a 
two-characteristic model. The problem is of practical 
importance because often there is more than one reasonable 
choice of the second characteristic to consider, and these 
would lead to higher dimensional state spaces. For example, 
length of service, time in grade and skill group are three 
reasonable choices of the second characteristic in military 
manpower planning models. Only one of these can be chosen, 
so it is useful to have some guidance regarding how one might 
combine states across the characteristics that are not chosen 
for inclusion in the two-characteristic model. In Section A 
we investigate the condition under which the states in each 
grade of a two-characteristic model are "lumpable" as defined 
by Kemeny and Snell (1S60) . In Section B we discuss how one 
might combine states when the conditions for lumpability are 
not satisfied. 

In Chapter VII the problem of retraining people between 
various skill groups is considered. Accordingly the second 
characteristic of our model is taken to be the individual's 
skill group. The retraining problem is formulated under a 
condition of equilibrium stocks and flows; attrition rates 
must be specified. The decision variables for the planner 
are stocks, promotion rates and total retraining costs. Of 
course these variables are interdependent. We treat stocks 
as the independent variables and show the relation of promotion 
rates to stocks when external flows are specified. Two methods 
for varying the stocks to achieve desired promotion rates are 
developed. Equations to compute the availabilities and 
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requirements for retrained people are shown. The total cost 
of retraining is modeled using the classical transportation 
problem. The foregoing availabilities and requirements are 
treated as the supplies and demands in the problem; the unit 
costs of retraining people are treated as the transportation 
costs. The relation of the total cost of retraining to the 
stocks (and promotion rates) is developed. We assume that 
the planner wants to minimize the total cost of retraining, 
however we do not assume that he can explicitly describe the 
constraints on stocks and promotion rates that must be 
observed in minimizing costs. We develop a technique in 
which the planner can vary the stocks in two successively 
indexed grades so as to reduce total retraining costs, while 
holding the stocks and promotion rates in all other grades 
unchanged. This technique is quite practical because the 
planner needs only to be able to recognize combinations of 
stocks and promotion rates that are acceptable rather than 
having to formally specify a set of constraints. The practi- 
cality of this technique is further enhanced by our demonstra- 
tion that when hiring is restricted to the lowest pay grade 
the total retraining cost is a convex function of the numbers 
of people hired and promoted each period. Thus if the 
collection of acceptable combinations of stocks and promotion 
rates is a convex set any locally optimal solution to mini- 
mizing total retraining costs is globally optimal. 
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II. PROBABILISTIC PROPERTIES OF THE TWO-CHARACTERISTIC MODEL 

In this chapter we develop various probabilistic proper- 
ties of the two-characteristic model. One of our purposes 
here is to present a unified, computationally tractable 
approach to deriving probabilistic information from a two- 
characteristic model. 

Let Q be the transition matrix for the transient states 
of a two-characteristic model. (The reader should see 
Chapter 3 of Kemeny and Snell (1960) for a general treatment 
of transient states in finite Markov chains.) For a typical 
military manpower planning application Q has dimensions at 
least 100 x 100. If the model were applied to the U.S. Navy 
enlisted force 'with nine pay grades as the first characteristic 
and 88 skill groups as the second characteristic, then Q would 
have dimensions 792 x 792. Military applications with even 
larger state spaces are not uncommon. 

Following Kemeny and Snell (1960) we define the "funda- 
mental matrix," denoted N, to be (I-Q) Many of the proba- 

bilistic properties of interest can be obtained from a 
knowledge of N. With large state spaces the computation of N 
is non-trivial. By imposing reasonable restrictions on the 
transitions between grades (no demotions and no multi-grade 
promotions) we are able to define a fundamental matrix for 
each grade i, determine many probabilistic properties of the 
overall system from these comparatively small matrices, and 
show how these matrices are combined to form the fundamental 
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matrix N. In military manpower planning applications, the 
number of states in any grade is typically one fifth to one 
tenth of the total number of states in the system, so 
computing the fundamental matrix for a grade is signifi- 
cantly easier than computing the fundamental matrix for the 
entire system. Usually only limited portions of the fundamental 
matrix for the whole system are needed, and these may be 
readily calculated using the techniques developed here based 
on the fundamental matrices of the individual grades. 

Certain matrix and vector notation used in this and 
subsequent chapters is summarized in Appendix A. 

A. DEFINITIONS AND ASSUMPTIONS 

Each state in the system is identified by a couple (i,j); 
a person is in state (i,j) if he is in grade i and has second 
characteristic j. There are n consecutively numbered grades: 
i=l,***,n. Each grade i has a set of values of the second 
characteristic, denoted J(i), so the state (i,j) is defined 
for jeJ(i) but not otherwise. For example if the second 
characteristic is length of service (LOS) , and a person in 
grade 1 can be in this grade only when his LOS is between 1 
and 3, then J(l) = {1,2,3}. 

We assume that the second characteristic takes on consec- 
utive integer values in each grade, so we define, 

l(i) = smallest value of the second characteristic 
associated with grade i, 

u(i) = largest value of the second characteristic 
associated with grade i. 
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Thus, 

J(i) = { j : j = 1 (i ) , ,u(i) }. 

Let w^ be the number of elements in J(i). Then, 

w i = u(i) - 1 (i) + 1. 

Denote the set of states associated with grade i by T^ . Then, 
T^ = { ( i , j ) : jeJ(i)}, i=l, • • • ,n. 

Note that T^ contains w^ elements, 
n 

Let T = U T., the complete set of all transient states, 
i=l 1 

and let T^ denote the single absorbing state "out of the 

system. " 

Let, 

1) be a x w^ matrix of one-step transition proba- 
bilities <3£(j,m), j,meJ(i), where q^(j,m) is the 
probability of a transition from state (i,j)eT^ to 
state (i ,m) eT^. 

2) be a w^ x w^ + ^ matrix of one-step transition proba- 
bilities p^(j,m), jeJ(i), meJ(i+l), where p^(j,m) is 
the probability of a transition from state (i,j)eT^ 

to state (i+1 ,m) eT^ + ^. 

3) be a w^ x 1 matrix of one-step transition proba- 
bilities a^(j), jeJ(i), where a^(j) is the probability 
of a transition from state (i,j) to "out of the system," 
i . e . , to Tq . 
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The basic assumption in this thesis is 

AO: Movement between states of the system follows the 

stochastic laws of a stationary finite state Markov 
chain . 

The following restrictions on movements between grades are 
assumed: 

Al: From any state in T^ it is possible to make a one- 
step transition only to states in T^, T^ + ^ or Tq. 

The practical significance of this assumption is that no 
person is promoted more than one grade in a single period, 
and no one is ever demoted. We have chosen for the sake of 
definiteness to call the first characteristic "grade." Any 
characteristic that satisfies Al can be used as a first 
characteristic. Length of service is an example of a 
character i. stic that satisfies Al, and we take advantage of 
this in Section E of Chapter IV. 

We also assume: 

A2: Each matrix Q^, i=l,***,n, is a transient matrix. The 

The practical significance of this assumption is that no one 
can stay in a grade forever. With Al this means that everyone 
entering the system must (with probability 1) eventually 
leave the system. 

Under AO through A2 the one step stochastic transition 
matrix for the entire system, denoted P, is 
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Q 1 P 1 



p = 



Q 2 P 2 



I A 



Q 3 \ | A 3 

\ \ 

\ P n-1 I A n-1 

I A n 



I 1 



(Recall from Appendix A that 0 is a vector of zeroes.) 

The transient matrix Q for the transient states T is then, 



Q = 



Ql P x 



Q 2 P 2 



°3 X \ 

V- 

\ 

Q n 



(1) 



The plan of this chapter is to develop the probabilistic 
properties of: 

1) any set of states T^, 

2) any union of consecutively indexed sets T^, 

m 

i.e. , (J T. , 
i=k 



3) the union of all transient states, T. 

One of the purposes of this development is to show that the 
stochastic properties of Q, typically a large matrix, are 
readily calculated in terms of the smaller matrices and P^. 
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The results of this chapter follow closely those in 
Chapter 3 in Kemeny and Snell (1960) . The notation 
(K&S,3._._) indicates that a result can be derived from 
theorem 3. .in Kemeny and Snell. 



B. FIRST-ORDER PROPERTIES 

The term "first-order properties" is used here as a general 
term for various probabilities and first moments, e.g., the 
probability of visiting a state, the expected number of visits 
to a state and the expected time spent in a grade. 

Under A2 the overall transient matrix Q has a fundamental 
matrix N = (I-Q) ^ , and each element of N is the expected 
number of visits to the column state starting from the row 
state (K&S,3.2.4). Under Al, Q has the structure shown in 
(1) . By direct methods it can be shown that 



N. = 



N 1 N 1 P 1 N 2 N 1 P 1 N 2 P 2 N 3 



N 2 N 2 P 2 N 3 



N. 



n-1 

n (n.p.)n 
i=l 1 1 n 



n-1 

. n (n.p.)n 

. n i i n 
1=2 



n-1 

n (n.p.)n 

. 0 i i n 

1=3 i 

I 



( 2 ) 



I 

N 



n 



where N^=(I-Q^) -1 , i=l,***,n, the fundamental matrix for grade 
i. Note that the large matrix N is completely determined by 
the matrices and P^. Thus the only matrix inversions 
required to find (I-Q) 1 are the inversions of (I-Q^) , 
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i=l,***,n. This is of considerable computational significance 
because as previously mentioned, Q is usually a large matrix. 

Each matrix has a probabilistic interpretation. 

We pursue this interpretation and show that these matrices 
can be used to determine other probabilistic properties of 
interest. 

In this and subsequent sections we make numerous defini- 
t h 

tions, the k definition is denoted by Dk. 

Let us consider first the properties associated with a 
single set of states and define: 

Dl. v^(j,m) = expected number of visits to state (i,m) 

given that grade i is entered in state (i,j) 
D2. = a w^ x w^ matrix having v^(j,m) as the element 

in row j-l(i)+l and column m-l(i)+l. 

From (2) the element of N^ in row j-l(i)+l and column 
m-l(i)+l equals the expected number of visits to state (i,m) 
given that grade i is entered in state (i,j). (K&S,3.2.4). 

So, from definitions Dl and D2, we have, 

V £ = N i . (3) 

Note that the rows and columns of N^ and correspond to 
states in grade i in the same way as the rows and columns 
of Q i - 

Now define: 

D3. T^(j) = expected time in grade i given that grade i 

is entered in state (i,j) 

D4. T i = [t ± (1 (i) ) , • • • , r i (u(i))], a w_ L x 1 vector. 

The expected time spent in grade i equals the sum of the 
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expected number of visits to the various states in grade i. 
From (3) and D3 , 

^^(j) = component (j-l(i)+l) of N^T, 
and from D4 , 

if\ = N^I, a w^ x 1 vector (4) 

(Recall from Appendix A that I is a vector with all compo- 
nents equal to one.) 

We next turn our attention to where the process goes 
when it leaves grade i. From assumption A1 (no demotions, 
no multi-grade promotions) the process upon leaving must 
enter either or T^. Next define: 

D5. b^(j,m) = probability of entering grade i+1 in state 

(i+l,m) given that grade i is entered in 
state (i,j) 

D6. = a w^ x w^ + ^ matrix having b^(j,m) as the element 

in row j-l(i)+l and column m-l(i+l)+l 
D7. b^(j) = probability of ever entering given that 

grade i is entered in state (i,j) 

D8 . b i = (b i (1 (i) ) , • • • ,b i (u (i) ) ] , a w i x 1 vector 
D9. b^Q(j) = probability of never entering given 

that grade i is entered in state (i,j) 

DIO. b iQ = [b iQ (1 (i) ) , * “ ,b iQ (u (i) ) ] , a w i x 1 vector. 

From these definitions it follows that _ 

= N^P^, a w^'X w^ matrix (K&S,3.5.4), (5) 

b^ = B^I, a w^ x 1 vector, 

and. 
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b. 



1 



= N.A., a w. x 1 vector. 

The matrix is particularly useful in our analyses. 

For example, let f^ be a 1 x w^ vector of the number of 

people entering T^. Then f^B^ is a 1 x w^ + ^ vector of the 

number of these people who will eventually enter T^ + ^. 
(K&S,3.3.6). This vector is used repeatedly in Chapter III. 

Next we consider the first-order properties related to 
grades i and k where i^k. Define: 

Dll. b ( (i , j ) , (k,m) ) = probability of entering grade k in 

state (k,m) given that grade i is 
entered in state (i,j) 

D12. B^ = a w^ x Wj, matrix having b ( (i , j ) , (k,m) ) as 

the element in row j-l(i)+l and column m-l(k)+l. 
From definitions D5 and Dll and a simple conditioning argument 
we have 



u ( i ) 

b( (i, j ) , (i+2 ,m) ) = £ ( j ,r) b i+1 (r ,m) . 

r=l (i) 

So, from D12, 



B. ‘.o — B . B . , 0 . 

1,1+2 i 1+2 

Notice from Dll that B^ is an identity matrix and from D5 

that B- . ,=B.. More generally it can be shown that for i<k, 
1 / 1 » JL 1 



k-1 

B.,= > B , a w. x w, matrix, 

lk L . r i k 

r=i 



Define: 

D13. v ( (i, j ) , (k,m) ) = expected number of visits to state 
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(k,m) given that grade i is entered 
in state (i , j ) . 

D14. V ik = a x w k matrix having v ( (i, j ) , (k,m) ) as the 
element in row j-l(i)+l 

D15. b^ k (j) = Probability of ever entering grade k, given 

that grade i is entered in state (i,j) 

D16 . b ik = [b ik (1 (i) ) , • • • ,b ik (u (i) ) ] , a w ± x 1 vector. 
Considering each row of B^ k as the part of an initial proba- 
bility vector that applies to T k , we then have, 



V^ k = B^ k N k , a w^ x w k matrix (K&S,3.5.4), 



( 6 ) 



and , 



b^ k = B^ k l, a w^ x 1 vector. 

Define : 

D17. x^ k (j ) = expected time in grade k given that grade i is 

entered in state (i,j) 

D18. x ik = [r ik (1 (i) ) , • • • , T ik (u(i))], a w i x 1 vector. 

The expected time in a grade is the sum of the expected number 
of visits to states in that grade, so 
T^ k = V ik l, a w^ x 1 vector. 

This completes our study of the first-order properties 
related to the various grades of the system. The foregoing 
definitions by no means exhaust the first-order properties of 
the two-characteristic model that might conceivably be of 
interest. It is felt, however, that these properties will often 
be of practical interest and that other first-order properties 
may be readily derived from these. 
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C. TWO SPECIAL CASES 

The elements of the fundamental matrix for grade i, N^, 
have a somewhat different interpretation when the states in 
grade i have what we call the "0-1 visiting property." We 
say that a state has the 0-1 visiting property if the state 
can be visited no more than one time. Important examples of 
two-characteristic models in which all transient states have 
the 0-1 visiting property are the models in which the second 
characteristic is either length of service (see Chapter IV) or 
time in grade (see Chapter V) . 

If each state in T^ has the 0-1 visiting property, then 
the expected number of visits to a state in r J\ is equal to the 
probability of visiting the state. The element of in row 
j-l(i)+l and column m-l(i)+l may then be interpreted as the 
probability of visiting state (i,m) given that grade i is 
entered in state (i,j). 

Another property of interest is the "no return property." 
we say that a state has the no return property if it 
is impossible to ever make a transition into the state after 
a transition has been made out of the state. The 0-1 visiting 
property implies the no return property, but they are not 
equivalent. For example, in modeling manpower flows in the 
U‘. S. Civil Service one might use "pay step" as a second 
characteristic. Each state is then a couple (grade, pay step) . 
A person can stay in the same pay step for more than one 
period, so if there are no demotions then each state would 
have the no return property but not the 0-1 visiting property. 
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If the states in TV have the no return property then it 
is possible to order the states in T\ so that is upper 
triangular. When is upper triangular so is I-Q^ and the 

i 

computation of the inverse of I-CK, i.e., the fundamental 
matrix for grade i, , is considerably easier than in the 
general case. 

If the states in T\ have the 0-1 visiting property, then 
not only is upper triangular but also the elements of 
on the main diagonal are all ones. 



D. VARIANCES 

t 

The format in this section follows closely that of Section 
B, but here we are concerned with various second moment 
properties of the two-characteristic model. 

Define: 

D19. v 0 . (j,m) = variance of the number of visits to 

2,1 

state (i,m) given that grade i is entered 
in state (i,j) 

D20. V„ . = a w. x w. matrix having v 0 . (j,m) as the element 
in row j-l(i)+l and column m-l(i)+l. 



Following (K&S ,3.3.3) , 

V 2,i “ V 2(N i>ag - - < N i>sg • 

(See Appendix A definitions of "dg" and "sq." ) 

Define : 

D21. t _ . (j) = variance of the time spent in grade i given 

2 , 1 . 

that grade i is entered in state (i,j) 

D22. t_ . = [T_ . (1 (i) ) , • • • ,r . (u ( i ) ) ] , aw. x 1 vector. 

2 , 1 . 2 , 1 . 2 , 1 . -L 
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Following (K&S,3.3.5), 



T 2,i - (2N i ' I)T i - < T i>sq- 
Define : 

D23. v 2 ( (i, j) , (k,m) ) = variance of the number of visits to 

state (k,m) given that grade i is 
entered in state (i,j) 

D24 . V 2 (i,k) = a x w k matrix having v 2 ( (i , j ) , (k,m) ) as 

the element in row j-l(i)+l and column 
m-1 (k) +1. 

Following (K&S,3.3.6), 

V 2 (i,k) = V ik (2(N k ) dg - I) - (V. k ) sq . 

Define : 

D26. x 2 ((i,j),k) = variance of time spent in grade k given 

that grade i is entered in state (i,j) 
D27. t 2 ( i / k ) = [T 2 ((i,l(i)),k),-*-, T 2 ((i f u(i)),k)] . 

a w^ x 1 vector. 

Following (K&S,3.3.6), 

TjU.k) = B ik (2N k - I) - (T. k ) sq . 

If each state in has the 0-1 visiting property, then the 
diagonal elements of are equal to one, and, 

<N i>ag - 

v 2,i = N i - < N i>sq ' 
v 2 (i,k) = v ik - (V ik ) sq . 



E. MATRICES OF t-STEP TRANSITION PROBABILITIES 

In this section we consider the probability of being in 
state (k,m) t steps after being in state (i,j). The 
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matrices of these probabilities are called the t-step transi- 
tion matrices. They are used in Chapter III to represent the 
stock vectors as a sum of steady-state and transient components. 
In Chapter IV the t-step transition matrices are shown to 
provide a description of how "cohorts" flow through the system. 
Define : 

D28. m (t : (i , j ) , (k,m) ) = probability of being in state (k,m) 

t steps after being in state (i,j), 
t = 0,1,2, •• • 

D29. M^(t) = a w^ x Wj, matrix having m (t : (i , j ) , (k,m) ) 

as the element in row j-l(i)+l and 
column m-l(k)+l. 

The rows of M., (t) are associated with states in T.: the 
lk 

columns of M^(t) are associated with states in T^. 

We have immediately that 

M . . ( 0 ) = I . 
li 

From assumption Al (no demotions, no multi-grade promotions) 
we have, 

M ik ( t) = 0 if i > k, 

M ik (t) = 0 if t < k-i . 

(Recall from Appendix A that 6 denotes a matrix of zeroes.) 

If the process is to be in state (k,m) exactly t steps 
after being in state (i,j), then it must be in some state in 
grades k or k-1 exactly t-1 steps after being in state (i,j). 
Conditioning on this fact leads to the recursive equation, 



M ik (t) = M ik (t " 1)Q i + M i,k-l (t " 1)P i-l 



t = 1,2, 



(7) 
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For any i and k the sum over t of the probability 
matrices M^(t) gives the matrix of the expected number of 
visits to states in grade k starting from states in grade i. 
So we have. 



oo 



I M 
t=0 



ik 



(t) 



V ik' i± k ' 



0 , otherwise 



(8) 



By assumption A2 the matrices are transient, so is a 

matrix of finite elements. This implies that. 



lim M (t) = 0. 

t-»-oo 1K 



(9) 



From (7) it can be shown by an inductive argument that 

M ik (t » ■T»i,k-i (t - 1 - rlp k -X' (10 > 



The t-step transition matrices provide a rather compre- 
hensive picture of how people move through a two-characteristic 
system. 



F. CONDITIONING ON PROMOTION 

In manpower planning one is often interested in conditional 
probabilities, e.g., the probability of attaining grade k 
given that grade i is attained. The stochastic properties 
of the transient matrix Q under conditioning on promotion are 
briefly developed in this section. 

Define: 

D30. (i,j:t) = the event "in state (i,j) at time t" 

D31. T* = the event "a transition is made into T^ before 
leaving the system.” 
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Conditioning on the event t£ is the same as conditioning on 
promotion to grade k. 

Define : 

D32 . q i (j / m) = Pr [ (i f m; t+1 | (i , j ; t ) ] 

D33. q*(j,m) = Pr [ (i ,m; t+1) | (i , j ; t ) , T* +1 ] 

Provided that Pr[T^ + ^| (i,j;t)] ^ 0, we have 
q£(j,m) = Pr [ (i,m; t+1) | (i, j ; t) , T* +1 ] 

= Pr [ (i,m;t+l) | (i, j; t) ] 

Pr t T ^ + i I (i / j ? t ) / (i ,m; t+1) ] 

x 

Pr [T i+1 | (i, j ; t) ] 



= q^jfin) 



= q^j,™) 



Pr [T? +1 1 (i,m; t+1) ] 

Pr [T* +1 | ( i , j ; t ) ] 

b i (m) 

b ± (j) 



Define : 



(ID 



D34. C. = a w. x w. matrix having the elements of b. 

(see D8) on its main diagonal and zeroes 
elsewhere. 

We will assume that promotion to grade i+1 is possible 
from every state in . Under this assumption ^ exists. 

If promotion to grade i+1 is impossible from some state 
(i,j) then we must avoid conditioning on an impossible event. 
This is readily accomplished by temporarily treating state 
(i,j) as part of Tq (out of the system) and redefining j(i), 
C>i' P^ and accordingly. 
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Define : 



* * 

D35. = a w^ x w^ matrix having q^(j,m) as the element 

in row j-l(i)+l and column m-l(i)+l. 

Then from (11) and D34, 




"3c 

The matrix is the matrix of within grade one-step transition 
probabilities conditioned on the attainment of grade i+1. 

Define : 



D36 . 


P i (j#m) = 


Pr [ (i+1 ,m; t+1) | (i, j ; t) ] 


D37. 


P*(j/m) = 


Pr [ ( i+1, m; t+1) | (i, j ; t) , T* +1 ] 


D38 . 


P* = a w. 
1 1 


x w. +1 matrix having P^(j,m) as the 



element in row j-l(i)+l and column m-l(i+l)+l. 

We then have, 

P?(j,m) = Pr [ (i+1 ,m; t+1) | (i , j ; t) , T? +1 ] 

= Pr [ (i+l,m; t+1) | (i, j;t) ] 

Pr ^i+i I (i'3/ 1) , (i+1 ,m; t+1) ] 

x 

Pr [T*+il (i , j ; t) ] 

= Pi(j,m) X . 

Thus from D34 and D38, 




The matrix P* is the matrix of one-step promotion probabili- 
ties conditioned on the attainment of grade i+1. 

Because (Q*) r = C^Q^C^, 

The fundamental matrix for grade i when we condition on 
promotion to grade i+1 is 
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* * -1 

N. = (I-Q . ) 

l l 



= ’ I (Q*) r 

r=0 1 



= IcfVc, 

r=0 

= C . -1 N . C . . 
l ii 

Define : 

D39. v*(j,m) = expected number of visits tc state (i,m) 

JL 

given that grade i is entered in state 

(i,j) and grade i+1 is attained. 

D40. V. = a w. x w. matrix having v. (j,m) as the element 
i i l 3 i J 

in row j-l(i)+l and column m-l(i)+l 



D41. 



D42, 



t>^(j,m) = probability of entering grade (i+1) in 
state (i+l,m) given that grade i is 
entered in state (i,j) and grade i+1 is 
attained. 

B? = a w^ x w^ + ^ matrix having b*(j,m) as the 

element in row j-l(i)+l and column m-l(i+l)+l. 



Then one may show that, 

* * 

V i = N i ' 



and, 




Note that B* is simply B^ with its rows normalized; Q* is not 
simply a row normalized form of Q^. 
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As with the matrices B^, products of matrices B? with 
successive indices are well defined; their meaning is that 
of a matrix B^ as defined in Dll and D12 with conditioning 
on attainment of grade k. 

The conditioned and unconditioned matrices may be used 
together. For example, the elements of B^ B^ + ^ give the 
probabilities of entering grade i+2 in the column state 
conditioned on starting from the row state in and even- 
tually attaining grade i+1. 
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III. EQUATIONS OF STOCKS AND FLOWS 



We begin by defining the terms "stocks" and "flows" and 
then discuss why stocks and flows are important in manpower 
planning models. Next the relations between stocks and 
flows in a two-characteristic model are developed. Finally 
we show how the stocks can be represented as the sum of a 
"steady-state" component and a "transient" component. 

A. DEFINITIONS AND BACKGROUND 

The following three definitions constitute a "timing 
convention"; it is used in this and following chapters. 

(1) A period is the interval of time from immediately after 
an integer value of the time parameter t up to and including 
the next integer value of t. A period is identified by the 
value of the time parameter at the end of the period. Thus, 

period t^ = {t: t^ -1 < t <_ t^} 

where t^ is an integer. 

(2) The number of people in a state at the end of a period 
is referred to as the "stock" in that state. Thus, stocks are 
counted only at integer values of the time parameter, t. 

(3) The number of people who change their status in the 
system from one state to another during any period is referred 
to as a "flow." Flows occur during a period, but we do not 
specify the exact time at which they occur. 

Stocks and flows are of primary importance in most man- 
power planning models. The most obvious reason for this is 
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that costs are closely related to stocks and flows, e.g., 
total payroll depends on stocks, transportation costs or 
retraining costs depend on flows. Recruiting policy and 
promotion policy depend in the short term on present stocks 
and in the long term on how we model future stocks and flows. 
Determining the feasibility of a retirement plan and evaluating 
the effects of a change in billet structure are other instances 
in which the planner needs to be able to model stocks and 
flows in a manpower system. 

It should be noted that the Markov-type model is not the 
only method that one might use to model stocks and flows. A 
different method is the "cohort model." Marshall (1973) 
presents a comparison of the Markov-type and the cohort 
models. Another method for modeling stocks and flows is the 
"chain model" presented in Grinold and Marshall (to be pub- 
lished) . A listing of various manpower flow models in U.S. 

Navy (1973) indicates that the Markov-type model is by far 
the most commonly used method in military applications. 

We now define the variables that are used to model the 
stocks and flows in the two-characteristic model. Recall that 
is the set of states associated with grade i, w^ is the 
number of states in T\, and for convenience of notation we 
assume the second characteristic takes on successive integer 
values in grade i. 

In a Markov model the stocks and flows are in general 
random variables. In this thesis we deal only with the 
expected values of stocks and flows. Such a model is called 
a "fractional flow model" because the transition probabilities 
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of the Markov model are in effect treated as fractions which 
direct flows through the system in a deterministic manner. 
Let, 

s..(t) = expected stocks in state (i,j) at time t, 



s i {t) = (s i,i(i) lt) - 



,s i,u(i) (t,) ' 



a 1 x w^ vector of expected stocks in T^, 



s (t) = (s 1 (t) , • • • /S n (t) ) , 
n 

a 1 x £ w^ vector of expected stocks in 



the system. 

By assumption Al, flows into any state in must come 
from a state in either TV or T^_^ . We will also make 
provision in our model for "external flows." The source 
of such flows is unspecified. However, we may consider 
external flows as consisting of people hired into the system. 
The external flows may be deterministic or random, but we 
deal only with their expected values. 

Let, 

d^j (t) = expected flow from states in T^ to state 
(i,j) during period t, a scalar: 

d i (t) = (t) , • • • , d i ^ u ^ i) (t) ) , a 1 x vector; 

f^j (t) = expected external flow into state (i,j) during 
period t, a scalar; 

f ± (t) = ( f if 1 (i) (t) # * * * • f i, u (i) (t) > ' a 1 x w i vector; 

g^j (t) = expected flow from states in T^_^ to state 
(i,j) during period t, a scalar; 
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g.j_ (t) = (g ± x (i) (t) , • • • , 9 i/U( i) (t) ) , a 1 x vector. 

When i = 1, g^j (t) is defined to be zero. 

The relation between the flow vectors and the stock vector 
in grade i is depicted in Figure 1 where "T^;t" denotes the 
states in grade i at time t. 




Figure 1. Stocks and F love in Grade i in Period t. 

B. BASIC STOCK EQUATION 

By assumption Al and the interpretation of external flows: 
s i (t) = d i (t) + f i (t) + g i (t). 

See Figure 1. 

It will be convenient to define, 

Sg(t) = 0, a vector of zeroes, 

P Q = 0, a matrix of zeroes. 

Using conditional expectation we then have 
d i (t) = s i (t-l)Q i , i = 1,***, n, 

g ± (t) = s i _ 1 (t-l)P i _ 1 , i = I,**’, n. 

The basic stock equation is then, 

s. (t) = s. (t-l)Q. + f . (t) + s. (t-l)P. , 1 < i < n. (1) 

1 1 1 1 -L.L 1 -L 

The basic stock equation for grade i can be written in 
terms of the expected or actual stocks in grade i in previous 



40 



periods. By recursively applying the basic stock equation 
for s^(t), s^(t-l),***, s i (1) one obtains 



s ± (t) 






t - 0/1,2, * * * , (2) 

i = 1 , • • • , n , 

which we will refer to as the cumulative stock equation . 

Equations (1) and (2) are used frequently in the remain- 
der of this report. Some manpower models used in the U.S. 
military for short-range forecasting consist principally of 
an application of an equation similar to (1) . 



C. TRANSIENT PROPERTIES OF THE STOCKS 

in this section we develop a method for expressing the 
stock vector as a sum of a "steady-state" component and a 
"transient" component. This method helps one to understand 
how the stock vectors will change in going from any present 
stock vector to future stock vectors. This method also helps 
one interpret the character of the limiting stock vector. 

We do not want to restrict ourselves to cases in which 
the stock vector converges (as t increases) to a finite 
vector, so it is best to specify what is meant by a "steady- 
state" component of the stock vector. We say that the vector 

/ V 

function s^(t) is the steady-state component of the stock 

vector s^(t) if, 

lim (s i (t) - s i (t) ) = 0 . 
t->°o 

For any sequence of stock vectors <s^(t)> there is more than 
one choice of the steady-state component. In applications 
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one would prefer a steady state component having a relatively 
simple mathematical form. We show that in some cases a 

r\j 

judicious choice of s^(0) makes this possible. The following 
theorem shows the properties of a class of steady-state 
components which can be quite useful. 

r\j 

Theorem . For any collection of 1 x w^ vectors s^(0), 

% 

i=l, • • • ,n, let the vector functions s^(t) satisfy 

s ± (t) = s i (t-l)Q i + f i (t) + s i _ 1 (t-l)P i _ 1 , t=l , 2 , - • • , 

i=l, • • • ,n. 

<v» 

i.e., the vector functions s^(t) satisfy the basic stock 
equation (1) . Then. 

(a) the actual stocks at time t are 

i 

s i (t) = ,s.(t) + ^(s k (0)-s k (0))M k .(t) , 

00 i 

(b) l ( s . ( t ) — s . ( t ) ) = l (s. (0)-s. (0))B..N. , 

t=0 1 k=l K 

a 1 x w^ vector having finite 

• components, 

'Aj 

(c) s^(t) is a steady-state component of the stock 
vector s^(t), i.e., 

lim (s . (t) -s . (t) ) =0. 

t+oo 

Before proving the theorem it will be worthwhile explaining 
why one might be interested in such a theorem. Part (c) of the 
theorem says that s^t) is a steady-state component of the 
stock vector s^(t) , and part (a) shows how the stock vector 
s^(t) can be expressed as the sum of a steady-state component 
and a transient component. Part (b) of the theorem says that 
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the total over all periods of the difference between the stock 
vector and its steady-state component is a readily calculated 
finite vector. 

Such information can be useful when long-range planning 
has been done using an "equilibrium model." As an example 
consider an organization which intends to change from its 
present size of 250,000 to a size of 200,000. The manpower 
planner may use an equilibrium model to develop policies 
that are in some sense optimal, and these policies will main- 
tain the size of the organization at 200,000 people once it 
has been reduced to this size. So the equilibrium model tells 
the planner what to do once the size of the organization 
reaches the desired equilibrium level but it doesn't tell him 
how to change the size of the organization from its present 
level (250,000) to the desired equilibrium level (200,000). 
This problem of finding an optimal transition policy to go 
from present stock levels to a future equilibrium stock 
distribution is a difficult one. One method for making the 
transition is to immediately implement the hiring, promotion 
and attrition policies that have been derived from the 
equilibrium model. Because of the transient nature of the 
system (see assumption A2) these policies will eventually 
bring the stocks in the system to their equilibrium levels. 

In the theorem the vector functions s^ (t) play the role 
of what the stocks would be at time t if the system were in 
equilibrium. The stock vectors s^(t) indicate what the 
stocks will be at time t if we start with the present stocks 
s ^ (0) and implement the policies of the equilibrium model 
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(which are reflected in the external flows, f ^ (t) , and the 
transition matrices Q^, and A^) . From part (a) of the 
theorem we may readily calculate the difference between 
actual stocks and equilibrium stocks in any grade and any 
period. If there is a penalty associated with having more 
people than the equilibrium stocks in the system, then part 
(b) of the theorem may be used to calculate the total penalty. 
Part (c) of the theorem assures the planner that the differ- 
ence between the actual and equilibrium stocks does converge 
to a zero vector as the time parameter t increases. 

The proof of the theorem follows. 

Proof. By hypothesis the vector functions s^ (t) satisfy the 
basic stock equation (1) , so they must also satisfy the 
cumulative stock equation (2) : 



. . t-1 t-1. 

s. (t) = s . (0)Q. + l f.(t-r)Q. r + i 

1 r=0 1 1 r=0 



®i-l (t-r-1) P i_fQ 



Of course the stock vectors s^(t) also satisfy the cumulative 
stock equation (2) , so we have, 

s i (t)-s i (t) = (s i (0)-s.(0))Q i t 



t ; 1 ^ r 

+ l (s i _ 1 (t-r-l)-s i _ 1 (t-r-l))P i _ 1 Q i r . 

r=0 

When i=l this implies, 

s^t) = s 1 (t) + (s^OJ-s^O) 

= s 1 (t) + (s 1 (0)-s 1 (0))M 11 (t) , 

so we have shown that part (a) of the theorem is true when 
i=l. Suppose part (a) of the theorem is true for grade i-1, 

i . e. , 
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i-1 



- + k | 1 (s k (0 >-=k'°») M k,i-i (t > 



Then 



i-1 



‘i-l't-r-il-Si-ift-r-l) = J i (s k ( 0 )-s k (°)) M k/ 1 .i(t-r- 1 ) 



and. 



s i (t)-s i (t) = (s i (0) -sL (0) )Q^ t 



+ X j! (S k {0 ) -\ ( 0 ))> 1 k,i-l (t - r - 1)P i-l Q i r 



r=0 k=l 
= (s i (0)-s j .(0))Q i t 
i-1 



t-1 






From Equation (10) in Section E of Chapter II, 

Jo M k,i-i (t - r - 1)p i-i °i r - M ki(t) - 
so we have shown by induction that, 

s i (t)-s i (t) = (s i (0)-s i (0))Q i t + 1 ); 1 (s k (0)-s k (0))M ki (t) . 

k=l 

This proves part (a) of the theorem. 

From part (a) , 

s i (t)-s i (t) = l (s k (0)-s k (0))M ki (t) , 
k= 1 



I (S. (t)-s. (t) ) = l l (s.(O)-s k (0))M .(t) 
t=o 1 1 t=0 k=l k • kl 



= l (s (0)-s (0) l M..(t) 

k=l k k t=0 K 



= k | 1 (s k (0) "®k (0))B ki N i ' 

a 1 x vector having finite components. 
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The last step above follows from equations (6) and (8) of 
Chapter II. This proves part (b) of the theorem. 

Part (c) follows from the fact that the sum in part (b) 
is finite. fj 

The utility of this approach depends on our ability to 
find vectors s^(0) such that the vector functions s^(t) are 
simple and readily calculated. The following subsections 
present examples. 

1. Fixed External Flows 

The equilibrium models previously mentioned enjoy some 
popularity in military manpower planning in the United States. 
The rationale underlying the use of such models is that one 
should determine the organization structure and the policies 
to maintain this structure which are optimal. Among, the 
policies derived from an equilibrium model is the hiring 
policy. This has the form, 

f^t) = f i , t = 1,2, • • • , i = 1, • • • ,n , 
where the vector of the number of people to be hired into the 
states in grade i each period, f^, is specified from the 
equilibrium model. 

Define, 

Sf ( 0 ) = fj^ . 

Then using (1) it is easy to show that 
s^ft) = for all t. 

Thus, from the theorem 

s 1 (t) = s^t) + (s 1 (0) ~s 1 (0) )M i;l (t) 

= f 1 N 1 + (s 1 (0)-f 1 N 1 )Q 1 t . 
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Now recursively define. 



f\j 'V 

s^(t) ^1^1^ 

s^ — ( f ^ t s -^P^_ ^ , i - 2 , • • • ,n . (3) 

It is straightforward to verify that these sL satisfy the 
basic stock equation (1) , so we have from the theorem, when 



f ± (t) = f. , 



s i ( t) = S i + l ( s k (°)"S k )M ki (t) 
k*" 1 



The steady-state component can also be written, 
i 



i = jiVkA' i = i '---' n 



( 4 ) 



i 

1 x w^ vector, so the 

limiting vector of stocks in grade i must be a nonnegative 
combination of the rows of N^ . In general, then, not all 
nonnegative 1 x w^ vectors are possible limiting stock 
vectors under constant external flows. 

2 . Linear Growth of External Flows 

In this section we consider the case in which the 
number of people hired into each state increases by the same 
amount each period. Such a hiring policy may not be natural 
over a long period of time, but it may provide a simple 
approximation to planned hiring policies. 

Let the 1 x w^ vector f^ be the amount that the number 
hired into states in grade i increases each period. Then 
the external flow vector for grade i is, 

f i (t) = tf i , t = 1 / 2 , • • • , i = 1 / ' ‘ * / n . 



Note that J f ,B, . is a nonnegative 
k=l K K1 
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Let 



s 1 (0) = f 1 N 1 Q 1 N 1 . 

Let the vector function s^t) satisfy the basic stock 
equation (1) , 

Si ( t) = s 1 (t-l)Q 1 + f x (t) . 

Using the identity N^Q^+I=N^ one can show that 
Si ( t) = tf 1 N 1 - . 

Thus from the theorem, 

s^t) = tf^ - f 1 N 1 Q 1 N 1 + (s 1 (0) + fjN^N^Q^ . 

We note that s^(t) is of the form 
s^ (t) = tL^ + 

where L^ = f^N^ is a 1 x vector, 
and is a 1 x w^ vector. 

Consider some grade ie{2,***,n}. Suppose that 
s i-l (t) = tL i-l + C i-1 ' 



where L. . and C. . are 1 x w. . vectors, 
l-l l-l l-l 

Using the identify 

(tf .N.-f .N.Q.N. )Q. + (t+l)f. = ( (t+l)f. N.-f .N.Q.N. ) , 
ill ii i i l ii 1111 

one may show that if 

V*) * t£ i K 'i- f 1 N i Q i N i + =i-i< t - 1)p i-i N i- L i- 1 p i-i N i Q i N i • 

% 

then s^ (t) satisfies the basic stock equation (1). Note 

'x, 

that s^(t) has the form. 



s L (t) = tL ± + C i , 



(5) 
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where , 



L i 



f i N i + L i-i p ±-i N i 



< f i + Li-i p i-i> N i ■ 



and 



( 6 ) 



c = -< f i + L i-i p i-i> N A N i - < L i-i - c i-i> p i-i«i 

■ - ((L i-i - c i-i> p i-i + t i N i Q i )N i • 

Thus we have shown that when the external flows grow linearly 
the steady-state component of the stocks also grows linearly. 
By recursive substitution in (6) we have, 

L i - JiW’i • 



Note that this vector gives the expected number of visits 

to states in grade i of f^=f ^ (t+1) -f j, ( t) entrants in grade 

k, k=l, ***,i. That is, the growth in the stocks in grade i 

each period, L^, equals the expected number of visits to 

grade i of the growth in the external flows each period in 

the grades less than or equal to i. 

Both L. and C. have the fundamental matrix N. as a right 
11 i 3 

factor, so the steady state component of the stock vector, 

<v 

s^(t), must be a nonnegative combination of the rows of N^. 
This same result was observed in the case of constant 
external flows. 

In summary we have shown that by choosing 
s. (t) = tL. + C. 

l l l 

where L. = f.N. when i=l, 
ill 



= < f i + i-i-i p i-i )N i ■ 



i=2 , 



, n 
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and C. = -f..N,Q,N, when i=l, 

i 1 11 1 

“ - (<L i-l ' C i-l )p i-l + f i N i Q i )N i' i' 2 '"-'"- 
then from the theorem the stock equation may be written 

i 

s^t) = s i (t) + l (s k (0)-s k (0) )M ki (t) . 

k 1 



3. Geometric Growth of External Flows 

In this subsection we will show that geometric growth 
of external flows leads (eventually) to geometric growth of 
the stocks. Geometric growth is a not uncommon phenomenon 
both in natural and man-made systems. Geometric growth is 
frequently a reasonable assumption for medium- to long-range 
planning in manpower systems. 

We consider the case in which the external flows 
into the states in grade i are proportional to a known vector 
f^ and grow geometrically at a rate 0^. Thus, 

f i (t) = 0 i tf i' = 1 ' 2 '***' 

i = 1, ,n 



e. > o. 

1 

When 0 < 0^ < 1, the external flows contract rather than grow. 

If 0 k is not an eigenvalue of for k <_ i £ n we may 



define. 



w 




If the states in grade i have the 0-1 visiting property then 
all eigenvalues of are zero and thus © k > 0 is never equal 
to an eigenvalue of in this case. 
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The following identity will be useful: 

VV = Ver Q i >r - 

r=0 k 

From this it follows that 

N i (0 k )Q i = 0 v(-I+N, (0 V ) ) ♦ 



Define , 



i ' k 



s x ( 0 ) = f 1 N 1 0 1 ) . 

Then it can be shown that if 



s l (t) = 0 1 f l N l (0 l ) * 

'X. 

then s^(t), t=0,l,***, satisfies the basic stock equation, 
and from the theorem, 

S l (fc) = 6 l t f l N i( 0 i) + (s 1 (O)-f 1 N 1 (0 1 ))M 11 (t) . 

Note that the steady state component of the grade 1 stock 
vector grows geometrically at the same rate as the external 
flows into grade 1. 

Define , 

B ki<V - Y N m ( 8 k> P m>- 1 i k i i i n - 

m=k 

Then it can be shown that if 
i 



.<t> = I \ 

1 k=l * 



t- (i-k) 



f k E ki ( 9 k )N i ( V 



then s^(t), t=0 , 1 , • • • , satisfies the basic stock equation (1) 
Note that in the limit the stocks in grade i grow geomet- 
rically at the rate of the largest 0 k where k£i. 

Define, 

6 M = max {0 k ; k=l, • • • ,i} . 
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The steady-state component of the stock vector is not in 
general a nonnegative combination of the rows of (as was 
the case with constant external flows and linear growth of 
external flows) . Rather the steady-state stock distribution 
is a nonnegative combination of the rows of The 

rows of need not be nonnegative combinations of the 

rows of N^, so the limiting stock distributions that are 
possible under geometric growth of external flows need not 
be the same as the limiting stock distributions under constant 
external flows and linear growth of external flows. 
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IV. THE (GRADE, LOS) MODEL 



The (grade, LOS) model is a model of a graded manpower 
system in which the second characteristic is length of 
service (LOS). By a person's length of service we mean 
the number of periods that he has been in the system. 

A. BACKGROUND 

Various manpower planning models that account for both 
grade and LOS are presently used by the United States military 
services. The incorporation of length of service into 
military manpower flow models is important in order to 
realistically analyze the policies of enlisted contracts and 
"retirement at 20." 

A person enlisting in the military service "signs on" 
for a number of years, usually two to five years. In the 
past, approximately eighty percent of those entering enlisted 
military service left the system upon expiration of their 
initial contract. This high attrition at initial contract 
expiration has had many effects, but two that are pertinent 
here are: 

1) There is a close relation between attrition and length 
of service for men serving on their initial contract. 

2) At least half the people in the ‘system are serving 
under their initial contract, so it is generally quite impor- 
tant that this group be modeled accurately. 
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The inclination of the operations researcher may then be 
to model manpower flows by a Markov-type model in which the 
states are lengths of service. Such models are structurally 
simple and computationally efficient. The author has worked 
on interactive retraining models of this sort for the U.S. 
Marine Corps. See also Grinold, Marshall and Oliver, 1973. 
But in practice manpower flow models are typically only part, 
albeit a crucial part, of larger planning models. Budget 
planners and operational planners usually demand manpower 
projections aggregated by pay grade; the distribution of 
people by their length of service is usually of secondary 
interest. 

Faced with the demand for a model that aggregates by 
pay grade and knowing that a valid model must treat the 
effects of length of service the operations researcher is 
led to conclude that a (grade, LOS) type of model is 
appropriate . 

The foregoing discussion does not imply that the (grade, 
LOS) model is appropriate only for military organizations. 
Various members of the "English school" of manpower planning 
have investigated the appropriate distribution function for 
the length of time that a worker stays with a company 
[Silcock, 1954; Lane and Andrew, 1955; Bartholomew, 1959]. 

The constant failure rate distributions (exponential/ 
geometric) were found to be quite inadequate, indicating that 
attrition rate and length of service are indeed related. 
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B. DEFINITIONS AND DESCRIPTION OF THE MODEL 

Let a person's LOS be the number of times the person has 
been counted in the system. Recall that under our timing 
convention a person who enters the system during period t 
is counted for the first time at the end of that period (at 
time t) , and is assigned an LOS of 1 at that time. A 
person's LOS increases by one for each successive end-of- 
period that he is counted in the system. 

It is assumed that once a person leaves the system he 
never returns. It is possible to modify the (grade, LOS) 
model to allow for departure and re-entry by assigning to 
each grade dummy states in which the LOS remains constant 
from one period to the next. In the model described here 
the "out of the system" state is treated as absorbing. 

The states of the system are defined by couples (i,j) 
where : 

(i,j) = the state corresponding to grade i and LOS j. 

The notation and results for the general two-characteristic 
model apply directly to the LOS model. In particular 
l(i) and u(i) are the lower and upper lengths of service for 
anyone in grade i. 

By definition as long as a person remains in the system 
his LOS must increase by one each period. (This is an 
example of a model in which all transient states have the 0-1 
visiting property.) Consequently, we need define only the 
following transition probabilities: 

q^_. = probability a person in state ( i , j ) at the end 
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of one period will be in state (i,j+l) at the 
end of the next period, 

= probability a person in state (i,j) at the end 
of one period will be in state (i+l,j+l) at the 
end of the next period, 

a^j = probability a person in state (i,j) at the end of 
one period will be out of the system at the end 
of the next period. 

By assumption A1 : 



q.. + p.. + a.. = 1. 
*11) 13 



The transition matrix A. is a w. x 1 matrix: 

1 1 



A i ° ta i,l(i)'---' a i,u(i) ) - 
The transition matrix is w^ x w^ and has non-zero 
elements only immediately above the main diagonal: 




( 1 ) 



The transition matrix is w^ x w^ + ^ an< ^ ^ as non-zero 
elements only on a single diagonal band. If 1 (i+1) j> 1 (i) +1 
and u (i+1) ^u(i)+l, then has the form shown below, where: 

1) the first max {0,1 (i+1)- (1 (i)+l)> rows are zeroes 

2) the last max { 0 ,u ( i+1) - (u (i) +1) } columns are zeroes. 
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Thus 



P. 

1 



0 

I 

i 

i 

P i, 1 (i+1) -1 

0 p 



i , 1 (i+1) 



P i,u (i) 



If l(i+l) £ 1 (i) / the first 1 (i) +1-1 (i+1) columns of 
are zeroes. If u(i+l)£u(i), the last u (i) +l-u (i+1) 
of P^ are zeroes. Under any circumstances is a 
"diagonal matrix," (see Appendix B) . 



0 • • • 0 

P . 
i 

rows 



(2) 



Define, 

r+k-1 

q.(r,r+k) = £ q.. if k^l, 

j=r J 

= 1 if k=0 

= 0 if k<0 . 

Then for k=0 , 1 , • • • , w^-1 we have 
0 ... 0 q i (l,l+k) 

0 q^ (1+1, 1+1+k 

q^ (u-k , u) 
0 

0 
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where 1 = 1 (i) , 
u = u (i) , 

and the first k columns and the last k rows are zeroes. 

V ST 

When k^>w^, =0. Recalling that 

N 1 - - Oil' 1 




we have in the LOS model, 



N. = l 
i r 



w . -1 

l 



k=0 



1 qi (l,l+l q ± ( 1 , 1+2 ) 

1 q^l+1,1+2) 



q ± (ifU) 
qi (1+1, u) 
q ± (u-l,u) 
1 



(3) 



The notation hides the rather simple structure of the 
fundamental matrix Nj in the (grade, LOS) model. Consider 
the case where l(i)=l and u(i)=4. We then have for grade i. 




N . = 

l 




‘il 




q il q i2 q i3 



q i2 q i3 



J i3 



(4) 



It is interesting to compare the structure of IsL in the 
(grade, LOS) model with the structure of a matrix, denoted 
B, of the submatrices from the general two-characteristic 

model: 
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B = 



B 



11 



12 


B 13 


B 14 


22 


B 23 


B 24 




B 33 


B 34 






B 44 


of 


Chapter 


II, 



k-1 

n i 

r=i 



B.,= II B , a w. x w, matrix , 
ik r 1 k 



so the matrix B can be written, 
I B, 



B = 



B 1 B 2 



B, 



B 1 B 2 B 3 



B 2 B 3 



B. 



(5) 



Recall that each element of B^ gives the probability of 
entering grade i+1 in a particular state given the state in 
which grade i was entered. Thus, the matrices 3^ summarize 
information about transitions from one grade to the next 
just as the . ' s summarize information about transitions 
from one LOS to the next. In Chapter VI we consider combining 
states in a two-characteristic model; the foregoing discussion 
indicates that if all states in a grade are combined then the 
information originally summarized in the w^ x w^ +1 matrix B^ 
must now be summarized by a single number, i.e. a lxl matrix 
B^ in the model that results from combining states. 

In manpower modeling the term cohort usually refers to a 
group of people who enter the system at the same time and in 
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the same state. For example the group of freshmen entering 
a college in a particular year comprise a cohort. After 
the cohort has entered the system we use the term cohort to 
refer to the members who are still in the system. 

Let us consider the movements of cohorts in the (grade, 

LOS) model when entry to the system is restricted to state 
(1,1) • The cohort which enters during period t^ must, at any 
subsequent time t^ + t, be in the set of states (i,j) such 
that j = t+l. Furthermore any cohort which entered the system 
during period 7* t^ cannot at time t^ + t be in any of the 
states (i, t + 1) . Thus when entry to the system is restricted 
to a particular state, e.g., state (1,1), the (grade, LOS) 
model preserves cohorts in the sense that members of different 
cohorts never simultaneously occupy the same state. The motion 
of a cohort through the system is clearly described by the 
t-step transition matrices. 

Recall that the t-step transition matrix from states in 
grade k to states in grade i, M^(t), has elements: 

M(t; (k,m) , (i,j) = probability of being in state (i,j) 

t steps after being in state (k,m) . 

The row index of M ki (t) is r = m - 1 (k) + 1; the column index 
of M^(t) is c = j - l(i) +1. Because LOS increases by one 
each period, if 

M(t; (k,m) , (i, j) ) 7* 0, 

then j - m = t. 

But j - m = t implies that 

(c + 1 (i) -1) - (r +1 (k) -1) = t, or 

c - r = t + (1 (k) - 1 (i) ) . 
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Thus the non-zero elements of are in row r and column c 

such that 

' c - r = t + ( 1 (k) - l(i)). (6) 

so M k j.(t) is a "diagonal matrix" with index t+ (1 (k) -1 (i) ) . 
(See Appendix B.) Furthermore the non-zero diagonal of 
M k ^(t+1) must be immediately above the non-zero diagonal of 
M^(t). Recalling that. 



CO 

£ M, • (t) = V, . , (see equation (8) of Chapter II), 
t=0 1 Kl 



we see that the non-zero diagonal of (t) must be the 
set of elements of in row r and column c satisfying (6) . 

Thus in the (grade, LOS) model all t-step transition matrices 
can be readily derived from the matrices V, . . 



C. COMPUTATIONAL CONSIDERATIONS 

Because of the sparseness of the and P^ matrices it 
would be most inefficient to store and manipulate the entire 
matrices. In Appendix B the storage and multiplication of 
"diagonal matrices" is discussed. A "diagonal matrix" is 
defined as any matrix (not necessarily square) having the 
property that for some k the element in row i and column j 
equals zero if i-j^k. The matrices and P^ in the (grade, 
LOS) model are diagonal matrices. 

It is shown in Appendix B that the product of diagonal 
matrices is a diagonal matrix. Only the elements of the non- 
zero diagonal and four numbers describing the diagonal 
matrix need be stored. To multiply an rxm diagonal matrix 
by an mxc diagonal matrix requires min{r,m,c) multiplications 
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as opposed to the product of r,m and c multiplications and 
additions in the general case. Consequently, in the (grade, 
LOS) model the fundamental matrix can be computed 
efficiently by computing the non-zero diagonals of successive 
powers of and combining these with an identity matrix to 
form the fundamental matrix N^. 



D. EXAMPLE: MINIMIZATION OF COSTS AT REENLISTMENT 

In this section we consider an example in order to 
illustrate how one might take advantage of the highly struc- 
tured form of the (grade, LOS) model to solve a practical 
problem. 

The U.S. Navy has used various reenlistment bonus plans to 
decrease the attrition rate at termination of the initial 
enlistment contract. The amount and effectiveness of the 
bonus depends on a number of factors; among them is pay grade 
at contract termination. 

We consider the case in which initial contracts are for m 
periods, and a reenlistment bonus is used to control the 
attrition rate a. for such grades k that a^ m is defined. 

To simplify notation it is assumed that a^ is defined for 
k=l,***,n. The cost of changing a^ m is specified by a non- 
negative quasi-convex function. Let t> e the cost of 

a Aa, decrease in a. from its base value, and let c. (0) = 0. 
Decisions are constrained by the requirements that in 
equilibrium the total stocks in each grade, s^l, must be no 
smaller than a specified lower bound s^ . All external flow 
is into grade 1 and f^ is specified. 
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Because under Al, 



1 km 



^km + ^km + a km ^ ' 

if is decreased then changes in either p^ m or q Um must 

occur. 

Let, 

Aq, = the increase in q, , 

M km M km ' 



Ap, = the increase in p, 
km A km 



Then , 

Aq lan + A Ptar. = ia kra • < 7 

The various transition matrices depend on the values of 
the and Ap^'s, and we denote this by Q k ^^km^ ' 

NktAq^), P^ (Ap^m) and B k ^^km^ * To s i m Plify notation some- 
what we define, 

Q k =Q k (0), 

N k = V°>' 

P k = P k (0), 

B k = B k (0,0) . 



The parameters of the matrices and are known, and N 

and B, arc calculated from them, 
k 

From the results of Chapter III we have 



i-1 

s i - f l r f 1 B r (i< 3rm' AP ri» )N i <Aq i m ) 

In the case &q km = 4p km = Aa km = 0 
stocks by s^ Q and, 



( 8 ) 



for all k we denote the 



s 



iO 



i-1 



f. n b n. . 

1 . r i 

r=l 



(9) 



63 



To minimize the total costs of the reenlistment bonuses 
while satisfying the constraint that total stocks in grade i 
be no smaller than s^ we must solve the following program 
denoted PI. 

PI) min I c k (ia km ) 



ST S..1 > s. , i = l,***,n 
i-1 

Si = fl r"l Br<Aqrm ' iP rni )n i ( ' lq ira ) ' i = 1 '‘"' n 
Aq km + A Pta ‘ Aa km ' k ” 

It will be shown that because of the structure of the 
(grade, LOS) model the constraints imposed by equation (1) 
are linear functions of the decision variables in program 
PI. That is, program PI requires the minimization of a quasi- 
convex function subject to linear constraints. 

The linearity of the constraints is shown in two steps. 
First, we show that they are linear when only one of the 
decision variables is non-zero. Next, we show they are linear 
for all feasible values of the decision variables. 

VJhen Aq, = Ap, =Aa. =0 we denote the values of the 
^km ^km km 

parameters q^,p^ m and a Vm with hats, i.e.. 



km 



^km ^km + ^km ' 



^km ^km + ' 



km 



a km a km ^ a km ' 



The value of s^ depends on q^ only through ^^km^ ' 
Let , 



j = m - 1 (k) +1 . 
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Then the only elements of N j c (^ < 3j cm ) that are functions of 



*km 



are those in row r and column c such that r £ j and 



c> j. Each of these elements has q. as a factor. Let 

km 

N k m be the Wj. x matrix that results from: 

1) setting equal to one in those elements of N^. in 



which 


it is a factor, 






2) setting all other elements of 


equal to z 


For illustration let 1 (k) 


=1, u(k) = 


5 and m=3 . 


=3-l+l=3 . 


The fundamental matrix is 




V A< W - 


. 1 

1 q kl q kl q k2 | q kl q k2 q k3 q kl q k2 q k3 q k4 




1 q k2 1 


q k2 q k3 


q k2 q k3 q k4 




1 1 
L 


q k3 


q k3 q k4 




1 


1 ^ 

! 

i 


1 


m 




i 

H 



where q k3 = q k3 + Aq kJ . 

The elements having q^ as a factor are in the indicated 
submatrix in the upper right corner of K^Aq^). Setting 
q ^3 equal to one in these elements and setting the other 
elements equal to zero we then have 



N 



m 



0 0 
0 0 
0 0 
0 0 
0 0 



0 

0 

0 

0 

0 



q kl q k2 



*k2 



1 

0 

0 



q kl q k2 q k4 

q k2 q k4 

q k4 

0 

0 
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From this illustration we observe that 



N k< 4, W - N k + 4< W J k m • 



m 



Note that N, is not a function of q 



km * 



Thus, if Ac 3 lm = Ap lm = A a lm =0 for i=l,***,n except that 
^km^O ' then the stocks in grade i are from (8) and (9) 



s i (4, W - f 1 ( i B r )Wk*iq km »k"' p k ll l 1 ¥i 

r=l r=k+ 1 



k-1 



“ s i0 + 4 < Wi ( VP'W " B r N i 

r=l r=k+l 



i-1 



Define, 



k-1 



rru 



i-1 



d ik = f l ( 11 B r } (N k P k } 11 Vi 1 ' a scalar - (10) 

r=l r=k+l 

Note that d^ is not a function of q^ m or of any 

or Ap^ m . So we have shown that when all Aq-^'s and Ap^'s 

except Aq^ m are zero then the total stocks in grade i are a 

linear function of Aq, : 

^km 

s.(Aq, )1 = s . „ + d., Aq, 
l M km lO lk ^km 

We now show the total stocks in grade i are also linear 
in Ap^ m when all other Aq-^'s and Ap^'s are zero. 

The value of s^ depends on Ap^ only through Pj^ ( Ap^) • 

Let Pj, m be the matrix that results from setting p^ m equal 
to one in Pj. (Ap km ) and setting all other elements of P^Ap^) 
equal to zero. Then 

p k' 4 Pk i »> = ?k<°> + 4 Pkm-V 



= P, + AP, P, 
k km k 



m 
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Now if Aq lm =Ap lm =Aa lm =0 for 1=1,- ",n except p, ?-'0, 



then the stocks in grade i are 

k-1 



i-1 



i Up km> " f l< J. “ B r N i 

r=l r=k+l 



k-1 



i-1 



= s. + Ap f ( n B )N,P, m n BN. 

lO r m 1 , r' k k , r 1 

r=l r=k+l 



Define, 



k-1 



i-1 



= f, ( n B )N, P, m n B N . 1 . 



ik 



1 r-1 r k k - 



r=k+l 



r l 



(ID 



Note that e^ is not a function of p Vm or of any Aq lm or 



km 



‘Ira 



Ap 



Ira 



So we have shown that when all Aq^'s and Ap^'s except Ap^ 
are zero then the total stocks in grade i are a linear 
function of Ap^: 

s . ( Ap, ) I = s . ^ + e . , Ap, 
l ^kra 7 iO ik r km 

We call N, m the m-dif f erential matrix of N, and P, m the 
k k k 

m-dif f erential matrix of P, . 

k 

The following lemma is used to show that when more than 



one Aq^ or Ap, (or both) are non-zero, the total stocks are 
still linear functions of the Aq^'s and Ap^'s. 

i-1 

Lemma. If in the product II B^SL , where B^N^P^, any 

k=l 



two (or more) distinct matrices are replaced by their m- 
diff erential matrices, then the product is a zero matrix. 

Proof. Because all states have the 0-1 visiting property 

i-1 

in the (grade, LOS) model, II B^N^=V^, the w^ x w^ matrix 
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of probabilities of visiting a column state in grade i 
starting from a row state in grade 1. Each of these proba- 
bilities is the sum of the probabilities of all paths 
through the state space which start in the row state and end 
in the column state. 

Replacing by times its m-dif f erential matrix Q^ m 

is equivalent to extracting all terms in which have q^ 

as a factor. This in turn is equivalent to restricting the 
paths from the row state to the column state to those which 
pass through both states (k,m) and (k,m+l) . Similarly, 
replacing by times its m-dif f erential matrix P^ m is 

equivalent to restricting the paths from the row state to the 
column state to those paths which pass through both states 
(k,m) and (k+l,m+l). Consequently, replacing two or more 
i-1 

matrices in II B, N.=V. . by their m-dif f erential matrices 
k=i k 1 11 

restricts the process to paths from the row state to the 
column state that have probability zero. (J 

Thus we have. 
i-1 

s. = f. II B (Aq , Ap )N. (Aq. ) 
l 1 . r Tm' ^rm' l ^im 

r=l 



i-1 



= f. n (N +Aq N m ) (P +Ap P *“) (N.+Aq. N.‘“) . 
1 , r ^rm r r r rni r i im i 

r=l 



m. 



m. 



From the lemma all of the "cross product" terms, i.e., terms 
containing two or more m-dif f erential matrices, are zero 
matrices, so we have, 
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= f 



i-1 

, n n p n. 

V-o. r r 1 



i k-1 i-1 

+ I Aq.f.. ( n b )N, ra p. ( n 
L , -‘Jan 1 ' r k k 



B )N. 



k=l 



r=l 



r=k+l 



+ 






k=l 



k-1 

^jVVk 



m 



i-1 

( n b )n. 

r=k+l r 



and using (9) , (10) and (11) , 



i i-1 

s i I = s io i + J 1 Aq km d ik + J/P^ik • 

This completes our demonstration that the constraints 
are linear in the decision variables. Thus the following 
program P2 is equivalent to the original program PI: 

P2] ■ni„J i C k (Aq km+ Ap km ) 



ST 



J, Aq kra d ik + 
k=l 



i-1 

^^ Ap km e ik — s i 



s^qI / i— 1 # * * * / n r 



The constraints need not be of the exact form shown in 
this example. The nature of the problem does not change if 
the constraints are of the form 



s.L. > s . . i=l, • • • ,n 
11-1 

where L. is a w. x c. matrix, 
l ii 

s. is a c . x 1 vector, 

l l 

and c. is the number of constraints on stocks in grade i. 

l 

The scalars d.. and e.. would in this case become 1 x c. 

lk lk i 

vectors, and their defining equation would be modified by 
replacing 1 with . 

There are various efficient techniques for minimizing 
a nonlinear objective function subject to linear constraints 
which could be used to solve P2. The exact form of the 
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objective function might indicate the most appropriate 
technique, e.g., separable programming, quadratic programming 
or gradient projection. The point of the example, however, 
is to show that one may take advantage of the structure of 
the (grade, LOS) model to solve problems that would be quite 
difficult in a general two-characteristic model. 

E. INTERCHANGING GRADE AND LOS 

When studying manpower flows with a two-characteristic 
model it is sometimes advantageous to interchange the first 
characteristic, grade, and the secondary characteristic. The 
feasibility of this depends to a great extent on whether the 
interchanged characteristic model satisfies assumption Al. 
That is, in the original two-characteristic model, is the 
value of the second characteristic restricted tc staying the 
same or increasing by one in each period? 

In the case of the (grade, LOS) model interchanging grade 
and LOS does result in a model which satisfies assumption Al. 
We consider some of the properties of this interchanged model 
which we call the (LOS, grade) model. 

Before interchanging characteristics it is convenient to 
expand the state space as necessary to insure that for each 
value of grade i and each value of LOS j there is a state 
(i,j). Recall that in the (grade, LOS) model we only define 
a state (i,j) when 1 (i) £ j £ u (i) . Let u be the largest of 
the u(i)'s. Then in the expanded state space (i,j) is 
defined for 1< j<u for i=l,***,n. 
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Now let us consider the structure of the submatrices 
in the overall transition matrix for the (LOS, grade) model. 

For any LOS j the n x n matrix contains the transition 
probabilities for going from one grade to another while 
maintaining LOS constant at j . By the definition of LOS it 
must increase by one each period, so such probabilities are 
zero. Consequently, for j=l,***,u , 




and, 

N. = (I-Q . ) _1 = I . 
3 3 



For any LOS j the n x n matrix contains the probabilities 
of going from one pay grade to another (or the same pay grade) 
while increasing LOS to j+1. It is here that we can see a 
possible advantage of the (LOS, grade) model since we can 
allow demotions and multi-grade promotions without violating 
assumption Al. If there are no multi-grade promotions and 
there are no demotions, then has non-zero entries only on 
and immediately above the main diagonal. If demotions are 
included in the model they cause non-zero entries below the 
main diagonal of P^; promotions of any type are reflected in 
non-zero entries above the main diagonal of P^ . Under any 
circumstances , 

B . = N . P . = P . . 

3 3 3 3 



If demotions are not included in the model, then B^ is upper 
triangular and so is the product of successively indexed 

matrices B.. 

3 
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For any LOS j the n x 1 matrix A., contains the proba- 
bilities of leaving the system from the various pay grades 
when length of service is j . 

The overall transition matrix P for the (LOS, grade) 
model has the same form as in any secondary characteristic 
model. The important difference is that in the (LOS, grade) 



P = 



we have 


Q .=0. 
3 


Thus , 




1 

OH 

M 




1 

1 


A 1 


0 


P 2 


1 


^2 




\ 


1 


i 




\ 


\ 


1 




\ 


P- 

/ 


1 

A 






\ “- 1 | 


u-1 






0 


A 








u 



To illustrate the effects of interchanging grade and LOS 
consider an example from the U.S. Marine Corps. The enlisted 
force of the Marine Corps has nine grades. The values of LOS 
range from 1 to 30. So in the (grade, LOS) model v/e would 
have nine 30x30 matrices and eight 30x30 matrices P^. 

Of course each of these matrices would have at most 30 non- 
zero elements. By interchanging characteristics, the (LOS, 
grade) model has all. matrices equal to zero matrices, and 
there are twenty-nine 9x9 matrices P^. Whether it is more 
practical to use nine pairs of relatively large matrices or 
twenty-nine relatively small matrices must depend on the 
problem to be solved. This example does illustrate that the 
modeler has a choice. 
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The fundamental matrix for the transient part of the 



process is, 



u 





u 



I 



p 



2 




N 



u 



I 




I 



The expansion of the state space so that all states in 

{ (i, j) : j=l, * * * ,u , i=l,***,n} are defined is not always 

necessary. In the (LOS, grade) model one may define, 

1 ( j ) = lowest grade that a person having LOS j may hold, 

u(j) = highest grade that a person having LOS j may hold. 

As before w^=u ( j ) -1 ( j ) +1 . Then is w^ x w., P^ is 

w. x w. n and A. is w. x 1. The nature of the model is not 
3 D + 1 3 3 

substantially affected. Computer storage requirements may be 
reduced but computer programming may be more complex. 

We note that the basic stock equation (see Chapter III, 
equation (1)) in the (LOS, grade) model is 



reflecting the fact that the stocks of LOS j in the next 
period do not depend on the stocks of LOS j in the present 
period. The cumulative stock equation (equation (2) , Chapter 
III) is the same as the basic stock equation. The following 
equation is derived from the fundamental matrix and gives 



s^(t+l) = f j (t+1) + s j_ 1 ( t ) p j_ 1 
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the stocks in terms of the external flows in previous periods 



s j (t) = ^, f m (t - ,n+ 3 ) / P k • 

J m=l k=m 

In summary the choice between the (grade, LOS) or the 
(LOS, grade) model depends on the information that is to be 
derived from the model. The (LOS, grade) model is more 
flexible in that the promotion/demotion structure is not 
restricted. The (grade, LOS) model might be more tractable 
when used as part of a budget model since costs are usually 
dependent on grade rather than on LOS. Under any circumstances 
the two models must yield equivalent information. 
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V. THE (GRADE, TIG) MODEL 



The (grade, TIG) model is a model of a graded manpower 
system in which the second characteristic is time in grade 
(TIG) . By time in grade we mean the number of periods that 
a person has been in his present grade. 

A. BACKGROUND 

There are occasions when it is desired to model the 
flows of "career motivated" people in a manpower system. 

The operational definition of a career motivated person 
depends on the case at hand, but generally we expect such 
people to have no predilection to leave the system -at the 
earliest convenient opportunity. Rather we expect the 
decisions of career motivated people with respect to leaving 
the system to be closely related to the recognition they 
receive for their performance in the system. One measure of 
recognition of performance is the combination of pay grade 
and time since last promotion. Such considerations may make 
a (grade, TIG) model appropriate. 

A somewhat different context in which the (grade, TIG) 
model may be appropriate is that in which the organization 
has an "up or out"policy. Positions in an organization may 
be (formally or informally) partitioned between those that 
are on the "path towards the top" and those that are 
terminal positions. The former group often has an up or out 
policy where "out" means transfer to a terminal position; the 
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(grade, TIG) model can be used to examine flows in positions 
in this group. 



B. DESCRIPTION 

A person who enters grade i during period t is counted 
for the first time in grade i at the end of the period (at 
time t) and assigned a TIG of one at that time. The 
person's TIG increases by one for each successive period that 
he is counted in grade i. 

The states of the system are defined by couples (i,j) 
where : 

(i,j) = the state corresponding to grade i and TIG j. 

The notation and results for the two-characteristic model 
apply directly to the (grade, TIG) model. In the (grade, 

TIG) model. In the (grade, TIG) model the value of 1 ( i ) is 
always one. 

We need define only the following probabilities: 

= probability a person in state (i,j) at the end 
of one period wll be in state (i,j+l) at the 
end of the next period. 

p^j = probability a person in state (i,j) at the end of 
one period will be in state (i+1,1) at the end of 
the next period . 

a^j = probability a person in state (i,j) at the end of 
one period will be out of the system at the end 
of the next period. 

Under assumption Al, 

q.. + p.. + a.. = 1. 
il *11 il 
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The transition matrix A^ is a x 1 matrix: 



A . = [a . , , 
1 1 il' 



' a i,u (i) ^ 



The transition matrix is x vn and has non-zero 
only immediately above the main diagonal: 



Q i = 



q il 

0 q 



i2 



0 



\ 

\ 



q i,u (i) -1 
0 



The transition matrix P. is w. x w. and has non-zero 

l il 

only in the first column: 



'il 



P i,u (i) 



0 -- 



I 

0 



- 0 

I 

I 

1 

• 0 



Thus, A^ and have the same form as in the (grade, 
model. Each fundamental matris N. also has the same 

l 



as in the (grade, LOS) model: 



N. = 

l 



1 q il q il q12 



u(i)-l 
n q 

j=l 






q i2 -~ 



u(i)- 1 

ji 4 

j=2 

i 
i 
l 

1 



13 



elements 



elements 



LOS) 

form 
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The matrix B . =N . P . has non-zero elements only in the 
111 2 

first column because grade i+1 can only be entered in state 
(i+1,1). Let the element in row j and column 1 of be 
denoted b^ ^ . Then, 
u(i) k-1 



b..= V ( IT q . ) p . , . 
13 ' m= j lra lk 



th 



Recall that B^l is a x 1 vector, and its j ** component 
is the probability of attaining grade i+1 starting from state 
(i,j). In the (grade, TIG) model. 



V = > b ii- 



,b i,u(i)^ ' 



so. 



b^j = probability of attaining grade i+1 from state (i,j). 

The product of successively indexed B^ matrices has non- 
zero elements only in the first column, and 



k-1 

b. , n b . 
ll . , , ml 
m=i+l 



k-1 

n b = 



. m 



m=i 



k-1 

b. 0 n b . 
i2 . nl 
m=i+l 



k-1 



b . . . . n b , 

i , u (l) . . , ml 

m=i+l 



0 0 

0 0 

: i 

1 I 

0 0 



k-1 

n b 

m=i+l 



ml'®! 1 - °) • 



If < 3j_j = 0f then state (i,j+l) is unreachable (except 
possibly through an external flow). If q^j^O, then, 
b . . -p . . 

b. . = ■ ^ x 3 ... 

i,j+i q. 



ii 
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This relation provides an efficient method for computing 
the elements of the vector B^l recursively: 

b i,u (i) = p i,u (i) ' 

b.. = p.. + q-b. , j=u(i)-l,***,l. 

^13 i,D+l J 

The (grade, TIG) model is equivalent to a discrete semi- 
Markov process. The states of the process are the n grades 
and Tq (out of the system) . Upon entering state i the next 
transition is to state i+1 with probability b^ and to Tq 
with probability 1-b^. The tail distribution of the time 
spent in grade i is the first row of the fundamental matrix 
N^. The results from the two-characteristic model in 
Chapter II, Section E can be used to find the distributions 
of time spent in state i conditioned on either promotion 
or leaving the system before promotion. Thus all the infor- 
mation required to set up the (grade, TIG) model as a discrete 
semi-Markov process is readily derived from the results 
established here. The use of the discrete semi-Markov process 
is covered in some detail in Howard, 1971. Because of 
assumption A1 there seems to be little advantage to treating 
the (grade, TIG) model as a semi-Markov process. If it is 
necessary that demotions and multi-step promotions be 
included in a model based on grade and time in grade, then 
the more general techniques of semi-Markov processes might be 
of value. We show in the following section, however, that 
interchanging characteristics in the (grade, TIG) model 
leads to a Markov model which allows the inclusion of 
demotions and multi-step promotions. This model is 
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computationally practical, but violates assumption A1 , and 
thus is not in the class of models analyzed in Chapters II 
and III. 

C. INTERCHANGING GRADE AND TIG 

Interchanging grade and time in grade results in a model 
that does not satisfy assumption Al (the new first character- 
istic, time in grade, decreases when the new second character- 
istic, grade, changes). Nevertheless, the (TIG, grade) model 
is not too difficult to analyze, and it enables the modeler to 
provide for demotions and multi-grade promotions. 

Let (j,i) denote the state in the (TIG, grade) model 
corresponding to j periods in grade i. As before i is in the 
set {l,***,n}, and we will let j take any value from the set 
{l,***,m} where the value of m is specified. Let be an 
n x n matrix of transition probabilities for transitions from 
states in { ( j , i) : i=l , • • • , n } to states in { (1 , k) : k=l , • • • , n } . 
That is, Pj is the transition matrix for transitions from one 
grade to a different grade when TIG is equal to j . Changing 
grades, of course, causes the value of TIG to change to one. 
The matrix Pj has zeroes on its main diagonal. Let Qj be an 
n x n matrix of transition probabilities for transitions from 
states in { ( j , i ) : i=l , • • • , n} to states in { ( j+1 , i) : i=l , • • • , n) . 
That is, Qj represents transitions in which grade is unchanged 
and TIG increases from j to j+1. The matrix has zeroes 
everywhere except possibly on its main diagonal. Let A^ be 
an n x 1 matrix of transition probabilities for transitions 
from states in { ( j , i) : i=l, • • • ,n} to out of the system. The 
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overall transition matrix P for the (TIG, grade) model is 
shown below: 



P 



m-1 



m 



Q. 



I A, 



Q m-1 | A m-1 

. A 

I 1 



The transient part of the process is represented by the 
submatrices in the upper-left part of P. We call this (as 
before) the transient matrix Q. By assumption A2 the 
process is transient, so the fundamental matrix, N=(I-Q) \ 
exists. Of particular interest is the nxn submatrix in 
the upper-left corner of N which we will denote by N^. The 
rows and columns of correspond to states in 

{ (1 , i) : i=l , • • • ,n} . The basic theorem on the fundamental 
matrix (Theorem 3.2.4, Kemeny and Snell, 1960] indicates that 
the element in row i and column k of equals the expected 

number of visits to state (l,k) starting from state (l,j). If 
the 0-1 visiting property holds (which in the (TIG, grade) 
model means no demotions) , then the element in row i and 
column k of equals the probability of attaining grade k 

given that grade i has just been attained. The submatrix 
may be computed in the following manner. 

Let, 



F = P , 
m m 



F : =p j + Q j F :+i ' 3- 1 ' 



,m-l, an n x n matrix. 
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The element in row i and column k of F^ is the probability 
of ever entering state (l,k) starting from state (j,i). It 
can be shown that, 

N X1 = (I-F 1 )" 1 , 

and that the first column of submatrices of N, an mn x n 

matrix, is [N, , , F-N, ,,*•*, F N, , ] . Most of the first-order 
11 2 11 m 11 

information of interest from the (TIG, grade) model is either 
contained in or readily derived from the first column of sub- 
matrices of the fundamental matrix N. 

In summary, interchanging characteristics in the (grade, 
TIG) model leads to a model having a transient matrix struc- 
ture quite different from that of the two- characteristic model 
as developed in Chapter II. However, first-order properties 
of the interchanged model are not too difficult to obtain. 

The most obvious advantage of interchanging characteristics 
is that demotions and multi-grade promotions may be included 
in the model. 
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VI. COMBINING STATES 



In this chapter we consider some of the mathematical 
properties of the two-characteristic model when the states 
in each grade are combined into a single state. The ideas 
discussed here are closely related to the concept of lumpa- 
bility as presented in Burke and Rosenblatt, 1958, and 
Kemeny and Snell, 1960. There is a difference between the 
approach taken here and that of the foregoing authors. Burke 
et al. are primarily interested in establishing conditions 
under which the combining of states in a stationary Markov 
process leads to a process which is still Markov and sta- 
tionary and has these properties for all (or at least some) 
initial probability vectors. In Section A of this chapter 
we briefly consider conditions under which the states of a 
two-characteristic model are lumpable. In Section B of this 
chapter we will consider combining states with the clear 
understanding that the resulting process may not be Markov 
or stationary for most initial probability vectors. Section 
C contains an example which illustrates the results in 
Section B. 

We restrict our attention to the combining of all states 
in each grade so that the "reduced state space" version of 
the two-characteristic model will have one state for each 
grade and a state for "out of the system." Obviously such a 
combining of states leads to a one-characteristic model. 



83 



A. LUMP AB I L I T Y 



Let <J>(m) be the random function which indicates the state 
that a Markov process is in after m steps. Recall that T\ is 
the set of all states in grade i. Let tt be an initial proba- 
bility vector over all states in the system. Then let 

(1) Pr [<f) (0) eT^] = probability the process starts in T^ 

given the initial probability vector 

fr, 

(2) Pr^ [<j> (m+1) eT t | <p (m) eT g , , <f>(l)eT r , ^(0)eT i ] 

= probability the process is in after m+1 
steps, given the initial probability vector 
7T and the events <j>(0)eT^, 

<P (1) eT r , • • • ,<J) (m) eT g . 

The latter probability is not defined unless the given sequence 
of events has positive probability under the initial proba- 
bility vector ir. 

Let T* denote the partition of the state space T|JTq into 
the sets of states, T^ , T^,***,T n . The foregoing proba- 
bilities (l)'and (2) define a stochastic process on T* , and we 
call this a lumped process. 

Definition . We say that the Markov chain in a two- 
characteristic model is "lumpable" with respect to the 
partition T* if for every starting vector tt the lumped process 
defined above is a Markov chain and the transition proba- 
bilities do not depend on the choice of tt. 

Theorem . A necessary and sufficient condition for the Markov 
chain in a two-characteristic model to be lumpable with respect 
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to T* is that for every pair of sets in T* , e.g., T\ and T^, 
the probability of a one-step transition from any state 
(i,j)eT. to some state in T, has the same value for every 
state in . These common values, denoted by {p^K form a 
transition matrix for the lumped chain. 

The foregoing definition and theorem are taken from 
Chapter VI of Kemeny and Snell, 1960, with paraphrasing 
to suit the case at hand. 

By assumption A1 for any starting vector , 

Pr^Un+l) eT t | <Km) eT g , • • • ,<M1) eT r , <(>(0) eT^ 

can be non-zero only if t=s,s+l, or 0. 

A 

We define the w^ x 3 matrix by, 

P. = [Q.I, P.1, A. I] . (1) 

Corollary 1 . The Markov chain in a two-characteristic model 
is lumpable with respect to T* if and only if for each grade 

A 

i=l,***,n the matrix P^ has identical rows . If the chain 
is lumpable, then the transition matrix for the chain is 




where the vector (p..,p. . . ,p..) is any one of the identical 

3-3- 3_ 9 3- + -L 3- U 

A 

rows of p^. 
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Corollary 2. A necessary condition for the Markov chain in 



a (grade, LOS) model or a (grade, TIG) model to be lumpable 
is that Q^=0, i=l,’:',n. 

Proof. In the (grade, LOS) and (grade, TIG) models u (jj = 0 
by definition. So, 

1) the last component of Q^I is a zero, implying 

A 

2) the rows of are not identical unless q^l=0, implying, 

A — 

3) the rows of P^ are not identical unless Q^=0, implying 
by Corollary 1, 

4) the chain is not lumpable with respect to T* unless 

Q i =0, i=l , • • • , n . [] 

In the (grade, LOS) and (grade, TIG) models if Q^=0, 
i=l,***,n, the lumped process is rather trivial because the 
only sequences of events that have positive probability are 
those of the form <}> (0) eT^ , <J> (1) eT^^, • • • , <f> (m) eT^ +m , or such a 
sequence with events ^(m+kJeT^, k=l,***,r appended. If the 
starting vector tt has positive probability only on T^ or we 
are given that <{>(0)eT^, then the lumped process is Markovian 

i 

but the transition probabilities depend on the value of tt, a 
situation Kemeny and Snell call "weak lumpabili ty . " 

B. LUMPING AN UNLUMPABLE PROCESS 

It may be necessary or advantageous in some circumstances 
to use a one-characteristic model. If the system is more 
accurately modeled by a particular two-characteristic model, 
then we are faced with the problem of lumping a process which 
may be mathematically unlumpable. In this section we develop 
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methods for qualitatively judging what parameters might be 
appropriate when lumping a two-characteristic model. It is 
convenient to abandon the stochastic interpretation of the 
model, and adopt a deterministic fractional flow viewpoint 
of the process. 

A _ _ _ 

Recall equation (1), P^ = [Q^l, P^l, A^l] , a w^ x 3 
matrix. In this section we assume that the lumpability 

A 

condition is not met, i.e., the rows of P. are not identical. 

1 

We are interested in the following fractions for each grade 
i=l, • • • , n : 

1) the fraction of those in grade i at t who remain 
in grade i for one more period, 

2) the fraction of those in grade i at t who get promoted 
to grade i+1 during period t+1, 

3) the fraction of those in grade i at t who leave the 
system during period t+1. 

The values of these fractions depend on the distribution of 
the stocks in grade i at time t. 

For any stock vector s^(t) ^ 0, we define, 

. s . (t) 

s.(t) = — , a 1 x w. distribution vector. (2) 

1 s i (t)l 1 

Then , 



Define the following functions which map distribution vectors 
into scalars: 



’. (t)P. = f 

l i ^ 



s i (t)Q i l 

s i (t)I 



s i (t)P i l 

s.(t)I 



fjwy 

s i (t)l 



a 1 x 3 vector. 
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q i (s i (t)) = s i (t)Q i l f 
p^s^t) ) = s i (t)P i l, 
a i (s' i (t)) = s i (t)A i I. 

Of 

If the vector of stocks in grade i at time t is s^(t) / 0 

Of 

and s^(t) is defined by equation (2), then, 

% 

q.(s^(t)) = the fraction of those in grade i at t who 

remain in grade i for one more period, 

Pi( s i(t)) = the fraction of those in grade i at t who 

get promoted to grade i+1 during period t+1, 

a i( s i(t)) = the fraction of those in grade i at t who 

leave the system during period t+1. 

We note that, 

q i (s i (t) )+p i (s i (t) )+a i (s i (t) ) = 1. 

Define the following function which maps a 1 x w^ 

distribution vector into a 1 x 3 distribution vector: 

^ % o- * 

(t) ) = s i (t)P i 

= (q i (s i (t)), p ± (s i (t)), a ± (s i (t)). (3) 

. . . 'll 

If the stock distribution vectors for grade i, s^(t), were 
known for t=0 , 1 , 2 , • • * , then one could combine the states in 

, _ ^ 'll 00 

grade i and use the sequence <p^(s^(t)> to form a non- 

stationary fractional flow model. 

The more pertinent situation is that in which s^(0) is 
known, but we do not know or do not want to compute s^(t) 

'll , 

and s^(t) for t> 1. Then in the absence of information about 
future stock vectors we want to choose the parameters of a 
stationary lumped process in some reasonable fashion. That 
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is, for each grade i=l,***,n we want to select three numbers 
(which we will denote q^, p^ and a^) to represent the 
fractions of the people in grade i who stay in grade i, get 
promoted and leave the system respectively each period. The 
lumped process would have a state for each grade and one state 
for out of the system. The transition matrix would be, 




Under the assumption that the lumpability condition is 

A 

not met, there is no choice of parameters for P that will 
in general be equivalent to the unlumped process. We may 

/s 

begin, however, by eliminating choices for p that are 
"obviously bad." 

% 

For any distribution vector s^(t), the 1x3 vector 

^ <\j ~ 

p^(s^(t)) is a convex combination of the rows of p^. Let FL 
be the set of all such vectors, i.e., 

H. = {h.eE^ : h. = s.P., s. 0, s.I = 1}. 

l l l l i' l ' l 

In choosing parameters for the lumped process, any choice 
(q^, p^, a^) which is not in FL is "obviously bad" because 
there is no stock vector for which this choice reflects the 
behavior of the system. 
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The set is a subset of the fundamental simplex in 
3-space. 

Clearly (from Corollary 2) the Markov chain of a two- 
characteristic model is lumpable if and only if for each 
grade i=l, is a single point. A plot of gives 
some qualitative indication of just how unlumpable grade i 
actually is. 

. . A t\j 

Next we consider the behavior of the sequence <P^ (s^ (t) ) > 
under various assumptions. 

1 . Constant Stocks 

If the stock vector does not change with time, i.e., 
s^(t) = s^, t=0,l,2,***, then the stock distribution vector 
is constant, 



■V ... s i _ % 

s i (t) = — f = s i ' 
s . 1 
1 



and the logical choice of parameters for the lumped process is 

(q i' Pi' a i> = Pi (a i> 



s . 
1 



s.l X 



p- ' 



1 

because these parameters exactly reflect the deterministic 
flow behavior of the system. 

2 . Convergent Stock Distribution Vector 

If the sequence of stock vectors, <s^(t)>, has the 
property that the corresponding sequence of stock distribution 

0, ... 'b 

vectors, <s^(t)>, converges to a distribution vector s^, then 
the sequence of linear transformations of s^(t), <p^(^(t))>, 
converges to P^(sL). In this case it may be reasonable to 



/ 
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choose the parameters of the lumped process by taking a 
convex combination of P^(s^(0)) and P^(s^). For example, 

(q^ Pi , a.) = (l-6)P i (s i (0)) + 0p A (s A ) , 0 < 6 < 1, 

where small values of 0 are appropriate for short-range 
planning and values of 0 close to 1 are more appropriate for 
long-range planning. 

3 . Constant External Flows 

It was shown in Chapter III that under constant 
external flows, i.e., 

f^(t) = f^, a 1 x w^ nonnegative vector, 
the- limiting value of the stock vector is 



= ( 



l 

l 

k=l 



‘k 



B ki )N i 



We define 



ii fk Bki ' 



(4) 



a 1 x w^ vector of inputs into each state in grade i in 
steady state. We assume g^ ^ 0. Then, 



= g • N . / (g . N . 1) 
i 3 i i 

g . g . 

= N )/(-±z N.l , 

g . 1 g . 1 

3 i h 

f\, 

so with respect to s^ we may without loss of generally 
assume that g^ is a distribution vector. 

Let 



(5) 



lit = {h^eE^: hu = s^P^, sL calculated from (4) and (5)}. 
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That is, FL is the set of limit points of the sequence 

/\ r\, 

cpi ( s i (t) ) > under constant external flows. We show that 
(except for a special case) the set is a proper subset 
of H i . 



Let, 



N. . = j 
ID 



.th 



row of N. , a 1 x w. vector, 
1 1 



ID 



j*"* 1 element of g^, a scalar. 



1, a scalar. (See equation (4) of 



Chapter II.) From Chapter II we have that is the 



average time spent in grade i starting from state 
(i, j+l(i) - 1) . Note that t. . > 1. Let D. be a w. x w 
matrix having {x^; j=l,***,w^} on its main diagonal and 



zeroes elsewhere. Then D. 



-1 



is defined. 



For any 1 x w. nonnegative vector g^, let 



g i = 



g^i 1 



7 (g ii T ii'*** 



/ 9-; w T i w ) , a 1 x w ± vector. (6) 
' i ' i 



Then it can be shown that g^ is a distribution vector, and, 
s. = g . N . / (g . N . I) 

1 3 / 1 



* -1 

= g. D.N. . 
^ii i 



(7) 



Let, 



S(w.) = (all 1 x w. distribution vectors), 
i l 



S(w^) = {all 1 x w^ distribution vectors 
determined by (7) }". 



% 



Clearly, S(w^) is a subset of S(w^); we now establish condi- 
tions under which it is a proper subset. 
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Equation (6) maps S (w^) into S(w^), and it can be 

shown that the mapping is onto. That is, for every in 

S(w^) there is a g^ in S(w^) such that equation (6) maps 
* 

g^ to g^. Thus, equation (7) may be viewed as a mapping 
from S(w^) into S(w^). We show, however, that the mapping 
in equation (7) may not be onto; i.e., there may exist 
distribution vectors in S(w^) which are not limiting stock 
distributions for any choice of constant external flows 
(recall that (4) gives g^ as a function of the constant 
external flows) . 

The w^ x w^ matrix (D^ is a non-singular linear 

'V, 

transformation from S(w^) into S(w^). It is well-known (see, 
for example, Halmos, 1956) that for any non-singular linear 
transformation T having determinant d and any measurable set 
E, with Lebesque measure m(E), m(E)/m(T ^E) = |d|. In the 
present context this can be written. 



m (S . (w. ) ) , 

/c r t = I det (D . N . ) I . 
m (S i (w. ) ) 1 'll 71 

But the matrix (D^ ISF) is nonnegative and its rows 
all sum to one (i.e., it is a stochastic matrix). So its 
eigenvalues are bounded in absolute value by +1. The deter- 
minant of a matrix equals the product of its eigenvalues, so, 

m (S . (w. ) ) , 

mlS^wpi - l det(D I "i'll 1 - 

Furthermore, m(S^(w^)) = m(S^(w^)) if and only if the magnitude 
of every eigenvalue of (D^ N^) is 1. It can be shown (see 
Karlin, theorem 2.1, page 97), that this can only occur when 
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the non-zero elements of are restricted to the main 
diagonal. This would mean that a person entering grade i 
in state (i,j) could not make a transition to any other state 
in grade i. 

~ 1 ^ 

If |det(D^ N^)|<1, then S^(w^) is a proper subset of 

S. (w.)* The sets H. and H. are linear transformations of 
11 1 l 

^ — 1 ' 

S(w^) and S(w^) respectively, so |det(D^ N^) | <1 implies that 

KL is a proper subset of H^. 

When the set of states in grade i has the 0-1 visiting 
property we may assume without loss of generality that ISb is 
upper triangular with ones on its main diagonal (see Chapter 
II, Section C) . Then the determinant of is +1. The 

determinant of the diagonal matrix is, 

w . 

i 

det D . = II t . . . 

1 j = l ^ 



But r . . > 1 , so det D . = 1 if and only if all t . . 1 s equal 1 . 

i] — l i} 

When the 0-1 visiting property holds this requires that 
be a zero matrix. Thus, if the 0-1 visiting property holds 
and is not a zero matrix, then 

m (S . (w. ) ) l . 

f . A— = ( n t . • ) <1. 

m(S i (w i ) j =1 ij 

Furthermore any change in that increases the average time 

'V 

spent in grade i, {r^j}, will decrease the size of S^(w^) 
and thus decrease the size of H. . 



In summary, it has been shown that under constant 
external flows the vector of the fractions retained in grade. 
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promoted and lost from the system converges to some point in 
H? which is in most cases of interest a proper subset of H^. 
Thus, when the lumped process is to be used for long-range 
projections, choosing the parameter vector (q^, p^, a^) from 
would seem quite reasonable. 

4 . Linear Growth of External Flows 

The case of linear growth of the external flows leads 
to results similar to those in the case of constant external 
flows. It was shown in Section C of Chapter III that under 
linear growth of external flows, f^(t) = tf^ , the stock 
vector in grade i is asymptotically linear and 
s ± (t) = tL ± + C i 

where 



L i = < f i + L i-i p i-i> N i' 

c i - + • 

The limiting stock distribution vector is 

£ im ^ £ im . t 

s.(t) = , s.(t)/s.(t)l 

t^ 00 i t->°° i x 

- ,tL i + + Ci * 1 

- vv • 



But, 



- = l 

l . L 



B 



k=l 



ki 



N i 



So the limiting value of s^(t) is the same in the case of 
linear-growth external flows as it was in the case of 
constant external flows. Consequently the set of all possible 
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is the same under 



Jc 

limit points for the flow fractions, , 
both constant and linear-growth external flows. 

C . AN EXAMPLE 

In this section we use an example to illustrate how the 
planner might use the results of Section B to combine states 
in a two-characteristic model. 

Let, 
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.1 


0 


0 


0 


.4 


l 


0 


0 
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0 
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0 
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m 






m 




m a 



Then , ] 

P ± = [Q^, P.jl, A i l] 

m • 

.2 .7 .1 

.5 .4 .1 

.4 .1 .5 

0 .1 .9 

J 

A 

Clearly, the rows of are not identical, so the condition 

A 

for lumpability is not satisfied. The rows of P^ are the 
extreme points of JL , ahd is plotted in solid lines in 
Figure 2. Any choice of parameters for the combined process 
should be taken from this set JL. 

Let the initial stock vector be, 
s i (0) = (20, 70, 10, 0). 
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Figure 2. Plot of Points and Sets Used in 
in Combining States. 



\ 
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I 



Then, 



s i (0) = s i (0)/s i (0)I 

= (0.2, 0.7, 0.1, 0), 



and, 

s £ ( 0 ) P i = (0.43, 0.43, 0:14). 
That is , 



q ± (s ± (0) ) = 0.43, 

P L (s i (0) ) = 0.43, 
a i (s.(0)) = 0.14. 

The point (0.43, 0.43, 0.14) is denoted by a star in Figure 2. 
For a short-range projection (one or two periods) this point 
would be a good choice for the grade i parameters of the 
combined process. 

Next let us consider what choices of parameters would be 

appropriate for long-range planning under constant external 

flows. From equation (7) of Section B.2 we know that the 

distribution vector for the equilibrium stocks in grade i, 

a. 

S., must satisfy. 




where g^ is a distribution vector. In the present 



example , 



N. = 

l 



1 

0 

0 

0 



0.2 

1 

0 

0 



0.1 

0.5 

1 

0 



0. 04 
0.2 
0.4 
1 



and the diagonal elements of are 1.34, 1.7, 1.4 and 1. 
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Thus , 



d: 1 n a = 



150 


.075 


. 030 


590 


.295 


.115 


0 


.715 


. 285 


0 


0 


1 



.745 
0 
0 
0 

In equilibrium under constant external flows the fractions 
remaining in grade, promoted, and leaving the system are 






s . P . , 
1 1 ' 


and 


from the 


above we 


A 

P. = g* 
l ^i 


dT 1 

i 


A 

N . P . 
1 1 






.755 


. 150 . 


075 .030 


•k 


0 


. 590 . 


295 .115 


= Si 


0 


0 


715 .285 




0 


0 


0 1 




.255 


.590 


.155 


* 


.403 


.288 


.309 


= Si 


. 286 


. 100 


.714 




0 


. 100 


. 900 








* 



.2 

.5 

.4 

0 



.7 
.4 
.1 
. 1 



. 1 
.1 
.5 
.9 



* 

Because g^ can be any lxw. distribution vector, we see that 

the parameters of the combined process in equilibrium under 

constant external flows, s. P. , must be a convex combination 

i i ' 

of the rows of the above matrix. The set of all such convex 

. . * 
combinations has been previously designated , and it is 

plotted with dashed lines in Figure 2. For long-range 

planning the planner should restrict his choice of parameters 

* 

for the combined process to this set H. . 
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Next let us assume that the planner estimates that in 
equilibrium the vector of the numbers entering the states 
in grade i would be approximately, 
g ± = (30, 50, 20, 0) . 

Then the equilibrium stocks would be 
s . = g . N . 

l l 

= (30, 60, 48, 19.2) , 
s ± = (.191, .382, . 305, . 122) , 

and , 

s. T> i = (.351, .329, .320) . 

The latter point is denoted by a triangle in Figure 2. In 
this case a convex combination of the points denoted by * and 
A in Figure 2 would seem a reasonable choice of the parameters 
of the combined process, i.e., 

( qj/ P± , a..) = 0 (.351, .329, .320) + (1.0) (.43, .43, .14), 
where 0 < 0 < 1. 

Small values of 0 are used for short-range planning, and larger 
values of 0 are used for long-range planning. 
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VII. AN APPLICATION TO RETRAINING PROBLEMS 



A. INTRODUCTION 

Consider an organization with people trained in various 
skills. Each person with a given skill belongs to a "skill 
group," which we will call "group" for simplicity. In this 
chapter we assume that the second characteristic in the state 
description is the group to which an individual belongs. We 
also assume that a person cannot belong to more than one group 
at a time. 

Group membership may be quite explicit as in the case of 
the U.S. Marine Corps system of occupational fields or it can 
be implicit as in the case where group membership is determined 
by the number of years of formal education completed. In any 
case group membership defines a partition on the organization. 

In many organizations retention and promotion vary con- 
siderably from group to group. Because of this it is often 
the case that people have to be retrained between groups in 
order to prevent surpluses and deficiencies of people in 
various skills. This is particularly true in the military 
enlisted personnel system. These retraining problems are the 
s.ubject of this chapter. 

We assume that there is a nonnegative cost associated 
with the retraining of a person from one group to another. 

The meaning and the numerical value of a retraining cost is 
left to the planner. For example, a planner may express his 
opinion that people in group k are unsuited for retraining 
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into group m by assigning a prohibitively high cost to such 
retraining. 

Unlike the LOS or TIG models, in the retraining model we 
are quite interested in how many people change their second 
characteristic each period. We assume that assumptions AO, 

A1 and A2 of Chapter II still hold. 

We are interested in long-range planning, so we begin 
with an equilibrium model, i.e., a model in which we assume 
that stocks and flows do not vary from one period to the 
next. The equilibrium model is very useful in determining 
achievable goals for an organization. Once determined, these 
goals can be used to judge short-range manpower policies. 

For example, use of the equilibrium model might show a long- 
range requirement for retraining into the sonarman group, 
and there might be a present surplus of sonarmen. The goals 
from the equilibrium model would cause us in this case to 
question a short-range recommendation to convert sonarmen 
training facilities to some other use. 

In a retraining problem the planner would ideally like 
to specify the stocks and promotion rates for each grade/ 
group combination (state) and then minimize retraining costs. 
Unfortunately, this is not generally possible, and the planner 
must compromise or trade off between desired stocks, desired 
promotion rates and minimal retraining costs. A major 
purpose of our development of the retraining model will be 

i 

to show the close interaction between stocks, promotion rates 
and retraining costs. Consequently, we take a descriptive 
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rather than a prescriptive viewpoint of the retraining 
problem. The prescriptive approach would require the planner 
to specify a mathematical description of his preferences with 
respect to tradeoffs between stocks, promotion rates and 
retraining costs. Such a requirement is usually quite 
impracticable . 

In the following development of the retraining model the 
attrition rates are treated as known and fixed. This may not 
be entirely acceptable. Retraining can be used as a method 
of reducing attrition. It can be used as an inducement to 
people to remain in the organization. This problem has not 
been treated here. 

One purpose of the mathematical analysis of the retrain- 
ing problem is to form the basis for an interactive computer 
program. Such a program would assist the planner in finding 
acceptable and feasible combinations of stocks, promotion 
rates and retraining costs. A device to be used in the 
interactive program, which is used in the mathematical 
analysis below, is proportionate control of stocks and 
promotion rates. For example, for a given grade the planner 
is required to specify the proportionate promotion rates for 
the various groups in that grade. This device may appear 
artificial; however it has great practical advantages. The 
U.S. Department of Defense is considering a promotion policy 
in which promotion rates within any grade must be equal for 
all groups. It is not our intention to join the debate over 
the efficacy of such a policy, but our results are quite 
useful in investigating the implications of such a policy. 



103 



B. DEFINITIONS AND THE BALANCE EQUATION 



The organization is partitioned into, states according to 
grade and group. It is assumed that transitions between 
states can be modeled by stationary fractional flows. The 
organization is assumed to be in equilibrium; stocks and 
flows do not change from one period to the next. One-period 
transitions from any grade are restricted to the same grade, 
the next higher grade or out of the system (see assumption Al) 
There are K groups indexed by k=l,*»*,K and n grades 
indexed by i=l,***,n. State (i,k) corresponds to grade i 
and group k. We define 



s . i 
lk 


= number of people in grade i and group k. 


s . 
1 


= * * * ' s iK.) ' a 1 x K vector. 


f ik 


= number of people who enter the system in state 
(i,k) each period, 


f i 


= ( f ii' ' * ' ' f i K ) ' a 1 x K vector, 


a ik 


= fraction of those in state (i,k) at the end of one 
period who leave the system during the next period. 


a . 

l 


= [a. ,,•••, a..,], a K x 1 vector, 
ll iK 


^ik 


= fraction of those in state (i,k) at the end of one 
period who are still in grade i (in any group) at 
the end of the next period. 


p ik 


> 

= fraction of those in state (i,k) at the end of one 
period who are promoted to grade i+1 during the 
next period. 


p i 


= [?ii' * * ‘ 'PfK 1 ' a K x 1 vector. 
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From assumption A1 we have 

9ik + Pik + a ik = ^ 

Nonnegative proportionality constants for the promotion 
rates, {ou^.: i=l, • • • ,n, k=l,***,K}, are specified so that in 

each grade i, 

p ik = p i a ik ' k=l , • • • , K, (1) 

where p? is the maximum promotion rate over all groups in 
grade i. 

Let, 

a. = [a ..,*•• ,a .] , a K x 1 vector with maximum element 
1 i _L IK 

equal to 1. Thus, 

p. = p*a. , (2) 

and v/e see that whatever the promotion rates for the various 
groups in grade i may be, they will be proportioned according 
to the components of cu. 

We begin by considering all K groups in some grade i. 

The number of people entering grade i during any period is: 

K K 

No. into grade i = £ f,. + I s. , .p. , . 

k=l lK k=l 1 i ' K 1 1,K 



= f ± l + s i_i p i_i • {3a) 

The number of people leaving grade during any period is: 

K K 

No. out of grade i = £ s i k a i k + l s i k Pi k 

k=l k=l 

= s . a . + s . p . 

1 x 1*1 

= s i (a i + p i ) . (3b) 
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Under the assumption that the system is in equilibrium, the 
numbers entering and leaving a grade each period must be 
equal. Thus 




(4) 



The above equation will be referred to as the balance 
equation for grade i. 

C. PARAMETRIC CALCULATIONS FOR PROMOTION RATES 

For specified stocks we use the proportionality constants 
for the promotion rates to solve for the promotion rates one 
grade at a time starting with grade 1. 

f^l ■■= s 1 (a 1 = p x ) 




* 

(Recall the pVs are scalars.) 
Thus , 




and 




For grades i=2 



n: 



f.l+s. . p . . =s.(a. +p.) 

1 1-1*1-! l l *i 




Thus 



* 




(5) 
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and, 

* 

p . = p . a . . 

Note that when the stocks are specified we can solve for 
all the promotion rates without explicitly considering the 
retraining flows between groups. 

We note that: 

1) f^I is the number of people who enter grade i each 
period from outside the system, 

2) s i-iPi-i t ^ ie nuir ‘k er of people promoted into grade i 
(from grade i-1) each period, 

3) s £ a i is th e number of people who leave the system from 
grade i each period. Thus p* > 0 if and only if the total 
number entering grade i each period is no smaller than the 
number leaving the system from grade i each period. 

Feasible promotion rates must satisfy 0£p^£l - a^ • 
The values of p^ computed above from the balance equations 
may not satisfy these constraints. It may also be the case 
that the promotion rates computed above are unacceptable for 
practical reasons. We next consider how one may trade off 
between stocks and promotion rates to obtain a satisfactory 
feasible set of rates. 



D.. PARAMETRIC CALCULATIONS FOR STOCKS 

Let us suppose that the specified stocks yield infeasible 
promotion rates in grade i, e.g., p* < 0. We might then 
approach the problem from the opposite direction in grade 
i by specifying the promotion rates and solving the balance 
equations for the stocks. The difficulty is that the solution 
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to such a problem is not unique. We will discuss two 
parametric methods that lead to unique solutions for the 
stocks in grade i when the values of the promotion rates 
in grade i are specified. 

In the context of interactive programming the parametric 
calculation of stocks is used to make adjustments when the 
initially specified stocks have led to infeasible or unaccept- 
able promotion rates. In Method 1 a specified proportionality 
between stocks in the various groups is maintained. In Method 
2 we maintain a specified proportionality between the 
deviations of the stocks from their desired levels. 

1. Method 1 



constants for the stocks in the various groups in grade i, 



In this method we specify nonnegative proportionality 



K 

{Y ik :k=1 ' ' ” ' K ^ where l Y ik 



k=l 



1. Then 




( 6 ) 



o* i s the total number of people in grade i. 




Yi = (Y n' ' ’ * 'Y iK ) ' a 1 x K vector. 



Then 




(7) 



Let the promotion rate vector for grade i, p^, be specified. 
The balance equation (4) for grade i is 



Thus , 



* _ f i X + S i-l p i-l 
S i Y.(a. + p.) 



( 8 ) 



and. 



s . . = s . Y . , 

lk 1 lk 



To interpret these results note that, 

+ Pi ) 

Y i (a i + Pi> ■ 7 



= s i (a i + Pj> 

s . I 
1 

= fraction of those in grade i who 
leave grade i (by attrition or 
promotion) each period. 



2. Method 2 



In this method we specify proportionality constants 
for the changes in the stocks in grade i, {6^; k=l,***,K}. 

We suppose that the desired stocks in grade i, denoted by 
{ s ik? h=l,***,K} lead to infeasible or unacceptable promotion 
rates, and require that the stocks satisfy not only the 
balance equation but also the parametric constraints, 

s ik = (1 + c i 6 ik )s ik 
where c^ is a scalar to be determined. 



Let, 




6. = 
l 


S iK>' a 


A 

s . = 
1 





and define an operator 0 such that E m x E m -*• F. m and 

6 i Ps i = ^il^il • ' ' ' ' <S iK^iK ) ' (See A PP endix A -'> 
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We then have 



s i = E i + c i h es i • 

Using Method 2 parameterization the balance equation 
(4) for grade i is 

V + s i-iPi-i - s i (a i + Pi> 

= (s. + c. 6.@s.)(a. + p.) . 

1 1111 *1 



Thus , 



°i = 



f.l + s. ,p. , - s.(a. + p.) 

l i-l*i-l l l *i 

(6 i 0 s.^ (a i + p i ) 



and, 



s ik (1 + c i 6 ik ) s ik * 



One advantage of Method 2 is that the stock in some 

A 

group k can be held constant at the desired level s^ by 
assigning 6^ a value of zero. 

A significant difference between the two method of 
parameterizing the stocks is that Method 1 always yields 
nonnegative stocks while Method 2 may not. In the initial 
stages of an investigation of retraining policies, Method 1 
would seem more practical. After feasible and not-too- 
unacceptable policies have been derived, then Method 2 could 
be used to advantage. Only Method 1 will be used to para- 
meterize stocks in subsequent sections. 



E. RETRAINING FLOWS 

Once we have arrived at a set of stocks and promotion 
rates that are acceptable, we may compute the required 
retraining flows. Let, 
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= number of people retrained out of (>0) or into 
(<0) state (i,k) each period 



r i " (r n' 



,r^ K ) , a 1 x K vector. 



We derive an expression for the retraining flow r^ by 
considering the numbers of people that enter and leave 
state (i,k) each period in the absence of retraining: 

No. into state (i,k) = f ik + S i-l,kPi-l,k 
No. out of state (i,k) = s ^j c ( a ij c + P^) • 

Under the assumption that the system is in equilibrium, the 
flows into and out of state (i,k) must be equal; we use the 
retraining flow r^ to bring about this equality: 

r ik = (f ik + s i-i,kPi-i,k>- s ik (a ik + p ik> • 

The balance equation (4) ensures that 
K 

I r ik = 0, 



(9) 



k=l 



i.e., everyone retrained out of one group is retrained into 
another group. 



F. THE RETRAINING TRANSPORTATION PROBLEM 

We assume there is a known cost c^(k,m) for retraining a 
person in grade i from group k to group m. We assume that 
all costs, including c^(k,k), are strictly positive. 

For any set of stocks and promotion rates in grade i that 
satisfy the balance equation, we may compute the retraining 
flows, {r^}, and treat these as supplies of people for 
retraining (r^j, 0) and demands for retrained people (r^ < 0) . 

The balance equation ensures that total supply equals total 
demand . 



Let, 



x^(k,m) = number of people retrained from state (i,k) 
to state (i,m) each period. 

In order to match supplies and demands at minimal cost, we 
must solve the following linear program. 

K K 

Pi] min l l c. (k,m) x • (k, m) 
h=l m=l 1 

K 

ST l (x i (k,m) - x i (m,k)) = r ik ; k=l,---,K 
m=l 

x^(k,m) 0; k=l,***,K m=l,***,K. 

The equality constraints in Pi ensure that for each state 
(i,k) the net number of people retrained out of or into state 
(i,k) matches the supply of or demand for retrained people 
for that state. 

We note that, 

K K 

l (x. (k,m) - x . (m, k) ) = l (x. (k,m) - x. (m,k)) 

m=l 1 1 m=l 1 1 

m^k 

is an identity. That is, x^(k,k) does not appear in the 
equality constraints of PI. The nonnegativity constraints 
do require x^(k,k) _> 0. Because we assume c^(k,k) > 0, the 
optimal solution to Pi will always have x^(k,k) = 0. 

The dual of Pi is denoted Dl: 



K 



max 


L 1 
k=l 


ik v ik 






ST 


v ik 


- v im - 


II 

♦ 

• 

♦ 


m=l , • • • , K . 




V ik 


UNRESTRICTED, k= 


a, • • • ,k. 
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It will be convenient to define the following vectors: 



X i = 



[x i (l,l),-*-,x i (l,K), 

2 

a K x 1 vector, 



x i (2,l), 



>x ± (K,l) , 



,x i (K,K)], 



v i = t v ii' * * * ' v ik ] ' a K x 1 vector. 

The linear program PI is quite similar to the classical 
transportation problem. It becomes a transportation problem 
if we make the following assumption. 

Assumption A3. For any h,j,m e{l,***,K}, 



c i (k,m) < c i (k,j) + c i (j,m). 

The practical implication of this assumption is that it 
must be cheaper to retrain a person from group k to group m 
than it is to retrain one person from group k to group j 
and retrain another person from group j to group m. 

The theoretical implication of this assumption is that 
the optimal values of some readily identified variables in 
PI must be zero. 

Let, 



K* = (k: r ik > 0}, 



kT = {k: r ik < 0}. 



Consider the case in which j eK^ , the optimal solution 

to Dl is v? , and the optimal solution to PI is x? . 

Suppose that x^(k,j) >0. By complementary slackness it 
must then be true that, 



v 



0 

ik 




c i (k, j ) . 



( 10 ) 
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But jeK^ , so there must be some m such that xY(j,m) > 0. 

This implies by complementary slackness that, 

o o , . . 

v . . - v . = c . (n ,m) . 

13 1m 1 J 

Adding (10) and (11) and using assumption A3, we have 
o o 



( 11 ) 



V ik v im 



c i (k,j) + c i (j,m) > c i (k,m). 



implying that the optimal solution to D1 , vV, is not feas- 
ible, a contradiction. Consequently, it cannot be true 
that x?(k,j) > 0 when jeK*. By a similar argument, we can 
show that it cannot be true that x?(j,m) > 0 when jeK^ . 



solution, if x?(k,m) > 0, then keK + , meK. 

' 1 ' 1 ' 1 



Thus assumption A3 implies that in the optimal primal 

Removing the 

variables which must be zero in any optimal solution from 
PI leads to the primal and dual equivalents of PI and D1 
under assumption A3: 

PRTP) min 



l + l 

keK. meK. 
1 1 



c i (k,m) x i (k,m) 



ST 



l _ x- (k,m) = r.. , kekt 
meK. 

T_ 

l + “ x i (k,m) = r 



keK i 



K 



. , meK. 

1m 1 



x ^ ( k , m ) 0 . 



DRTP) 



max 


L r 




11 1 


ST 


v ik 




v ik 



£ r . , v . , 
L lk lk 



r ,. < c.(k,m); keK., meK. 

1 m — 1 11 



UNRESTRICTED. 



114 



The abbreviations PRTP and DRTP denote the primal and dual 
retraining transportation problems respectively. The 
programs PRTP and DRTP are in the form of the classical 
transportation problem. 

We should pause to note some of the assumptions inherent 
in modeling retraining costs in this rather simple manner. 

1) There is no provision for set up costs. 

2) Each grade is treated as a separate problem. 

3) Marginal retraining costs are assumed constant with 
respect to the number retrained. 

The computational tractability of the simple transportation 
model is no small consideration, and one suspects that the 
model is adequate for the purposes at hand, viz., long-range 
planning . 

Let the optimal value and the optimal solution to DRTP 

, O _ _ j . O , O ° 



be denoted d^ and v^ = [v^ , • • • , v^] respectively. Then, 



K 



i° = l r. 
i , , i 



k=l 



ik v °k • 



( 12 ) 



We note that the optimal solution to DRTP is unique only to 
an additive constant. 



G. RELATION OF RETRAINING COSTS TO STOCKS AND PROMOTION RATES 
In this section we assume that the stocks and promotion 
rates in .grade i-1 have been determined, and we consider how 
our choice of the stocks and promotion rates in grade i will 
affect the optimal retraining costs in that grade. 

We will parameterize both the stocks and the promotion 
rates in grade i using Method 1 for the parametric calculation 
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of the stocks. Thus, we must specify two sets of propor- 
tionality constants (see equations (2) and (7)): 

, K} for the promotion rates, 



1) (“ik' k=1 ' 



2) {y ik' k=1 ' 



/ K} 



for the stocks. 



We consider the case in which the total number of people 
entering grade i each period is known and fixed: 



f . + s 



@ = 9^/ a known 1 x K vector 



f^l + s^_^p^_^ = g^i/ a scalar. 

The stocks and promotion rates for the various groups in 

* * 

grade i are determined by the choice of s^ and p^ : 
s ik = s i Y ik - 
Pik = ?! a ik • 

The balance equation (’ 4 ) places a constraint on the choice 

X * . 

of s and . We consider as the independent variable, 

so the balance equation, 

g.I = s.(a. +p.) 

ii *1 

* * 

= s . y . (a . + p . a . ) , 

ill *i l ' 

* 

leads to the relation of the dependent variable p^ to the 



independent variable s^ : 



gi I - s ± y i a i 



(13) 



s . Y • a • 

l ii 



This relation is sketched on the following page. 
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Figure 2. 



Sketch of s*. and 
• 1 



* 




The curve is asymptotic to the lines s^ = 0 and 

p* = -y . a . /y . a . . 

1 1 a/ 1 1 l 



We must have p. > 0, which by (13) is equivalent to 



l — 



it ™ 

s i^i a i — ^i^' i' e *' the total number leaving the system from 
grade i each period must be no greater than the total number 
entering grade i each period. It is only when the above 
inequality is strict that we may (indeed must) promote people 
out of grade i. 
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* * 

Let us suppose that the couple (s^, p^) satisfies (13). 
We then use the retraining flow equation, 



ik 



= 9 



ik 



- S I Y ik (a ik + ?! a ik>' 



(9) 



to determine the RHS of the retraining transportation problem. 

'k "k k 

The value of r^ k depends on s^ and p^ . Because p^ is 
uniquely determined by s? from (13) , we may consider r^ k a 
function of s? . Substituting the expression for p* from 
(13) into the equation for r^ we have, 



g.l - s.y.a. 

r., (s.) = g., - s.y., (a., + ( 

ik i y ik l ik ik * 

s.y.a- 

i'i l 



’“ik 1 



After some manipulation this leads to, 

, *. _ r , ^ik _ ^ik a ik ' * , Y ik a ik 

r ik s i ^i^ t y.a. s i Y i a i^y.a. 

g^l 'i l 'i l 



Y ik a ik 

Yi a i 



) 



(14) 



We assume the vector of group-to-group retraining costs, 
c^, is fixed and known, so the optimal value of the retrain- 
ing transportation problem is a function of the vector of 
retraining flows, r^. But we have shown that for fixed flow 
into grade i, g^, the retraining flows are a function of s* , 
so the optimal value of the retraining transportation problem 
is a function of s? . We show this by rewriting the primal 
and dual of the retraining transportation problem. 

PRTP] min <Jk = £ £ 



c ^ (k,m) x^ (k,m) 



keK . meK. 
i l 



ST 



l x i (k,m) = r. k (s*), kekt 

meK . 
i 

r * - 
L x i (k,m) = - r im ( s i)/ mcK i 

keK + 

l 



x ^ ( k , m ) 0 . 
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K 



DRTP) n,ax d i (.s*) = Z r (s*)v 

k=l 



ST v ik ” v im - C i ; keK i / 



V ik' v im UNRESTRICTED • 

The question that we address in this section is how the 
optimal retraining costs vary with respect to the total 
stocks s? . 

For some specified values of s* let d?(s*) be the optimal 

value of the retraining costs and let v?(s*) be the optimal 
solution to the dual retraining transportation problem. 

We then have, 



3dJ(s*) 

8s* 

1 



K 



s. k=l 
1 

K 



£ r (s*)vv, (s. ) 
ik‘ i' lk l 






= Z . 

k=i 1 as. 



+ r (s. ) 
lk i 






* 

3s. 

i 



From (14) we have immediately, 



ar ik (s i> 

* 

9s i 



, Y ik a ik Y ik a ik. 

= Y^a. -77—^ — ) . 



ii y . a . 

1 i i 



y . a . 

1 1 l 



The question then is, how does the dual optimal solution 

v°(s*) vary with st. We note in the dual that varying s* 

does not alter the feasibility of v?(s*). We note in the 

* 

primal that varying s^ varies the RHS of the program, so the 

* 

primal optimal solution must vary with s^. But suppose that 
the primal basis that was optimal at s^ is still feasible at 
s* + e for |e| sufficiently small. Then this primal basis 
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is feasible, it satisfies the complementary slackness 
condition with respect to v°(s*), and v?(s?) is feasible in 
the dual, so v?(s?) is still the dual optimal solution. In 

summary, the dual optimal solution, v?(s*), does not change 

★ ★ 
with s^ so long as the primal basis which was optimal at s^ 

remains feasible. (See Dantzig, 1963.) 

Consequently, if for some e>0 changing the stocks from 

s i to t z does not cause a primal basis change, then. 



_ o , ★ . 
3v . , (s . ) 
lk 1 

* * 

3 s . 
i 



= 0 , 



and 



3d?(s*) 



l ' i' v ° / *\ , Y ik a ik 

—r- = Vi J, v ik (s i> ‘tit- 

3s. k=l 'i l 

l 



Y ik a ik 

Yi a i 



(15) 



We note that: 



1) 



Y . a . 

1 1 l 



s . y . a . 

l 1 1 l 
* 
s . 
i 



s . a . 

l l 



S i 1 



= fraction of those in grade i who leave 
the system each period 



2 ) 



Y • i a . , 
lk lk 

Y • a . 
l l 



s . y • i a . . 

l ' lk lk 
★ 

s . y • a . 
ii l 



s . , a . , 
lk lk 

s . a . 

l l 



= fraction of those leaving system from 



3 ) 



Y • i a • i 
1 lk lk 

Y • a . 

1 1 l 



grade i who leave from state (i,k) 
★ ★ 



s . y • i P • a . , 
i'ik y i ik 

4 - 

s . y . p . a . 

i 1 i*i i 



s ikPik 

s iPi 



= fraction of those promoted from grade 
i who are promoted from state (i,k). 
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1 . Special Case 1 — Costs Constant 



Special case 1 is that in which- for some positive 
constant c, 

°ik = ca ik- k=1 '"'- K - 
In this special case, 

a • — ca . 
i i 



a ik 

Y i a i 



ca ik 

CY i a i 



ik 



Yi a i 



^ik a ik ^ik a ik 



Yi«i 



Y . a . 

' l l 



= 0 , 



r ik (s i> 



T^ik 



g i 1( I ± f 



g ii 



Y ik a ik 

Yi«i 



) , 



8 s* 



= 0 



Thus, when promotion rates are proportional to attrition 
rates, the retraining flows do not depend on the total 
stocks s^. Consequently, the optimal retraining costs do 
not depend on s^ . 

The sensitivity of retraining costs to the total 
stocks depends in some sense on how promotion and attrition 
rates depart from the foregoing special case. 

2 . Special Case 2 — Costs Increase with Stocks 

A second special case is that in which for each 



ke{l, •**,!<}, either, 



or, 



2) > ^ik a ik > 



gjL I " Y i a i 



We note that. 



ik 



a 



Y 



. a 



1 



ik 

i 



y ik 

— 3 = fraction of those entering grade i who enter in 

g . 1 

group k. 

Thus, condition 1) above indicates that the fraction of grade 
i "entrants" entering group k is smaller than the fraction of 
grade i "promotees" promoted from group k which in turn is 
smaller than the fraction of grade i "leavers" leaving from 
group k. From equation (14), condition 1) implies that both 
r ik ( s *) and its partial derivative with respect to s* are non- 
positive. That is, there is a demand for retrained people in 
group k of grade i, and as the total stocks in grade i 
increase so does this demand for retrained people in group 
k. 



Condition 2) is simply condition 1) with the inequalities 
reversed. Condition 2) implies that both r^fs*) and its 
partial derivative with respect to s? are nonnegative. So 
there is an excess of people in group k of grade i, and as 
the total stocks in grade i increase so does this excess of 
people in group k. 

In this special case increasing total stocks increases the 
supplies in the groups that have people available for retrain- 
ing and increases the requirements in the groups that have 
a need for retrained people. Consequently, increasing total 
stocks, st, increases the optimal retraining costs. 
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3 . Special Case 3 -- Costs Decrease with Stocks 

A third special case is that in which for some values 
of st we have for each ke{l, •••,!<}, either, 



1) 



or, 



, ^ n j Y ik a ik ^ Y ik a ik 

r.. (s.) > 0 and < 

ik' i' y.a. - y.a. 



l l 



l l 



. « , ^ik a ik . ^ik a ik 

2) r (s) < OaM — . 

'll 'll 

Condition 1) above indicates that group k has people available 

for retraining, and the fraction of grade i "promotees" promoted 

from group k is smaller than the fraction of grade i "leavers" 

* 

leaving from group k. If the total stocks, s^, are increased 
then the number leaving the system from group k increases in 
direct proportion to s^. However, increasing s^ causes the 

•ft 

overall promotion rate p^ to decrease, so the number promoted 

from group k does not increase as rapidly as s^. Under 

condition 1) we then have a decrease in the number to be 

* 

retrained when s. increases. 

l 

Condition 2) is simply condition 1) with the inequal- 
ities reversed. Under condition 2) there is a requirement for 
retrained people, but this requirement decreases as s? 
increases. 

In this special case increasing total stocks decreases 

the supplies in the groups that have people available for 

retraining and decreases the requirements in the groups that 

have a need for retrained people. Consequently, increasing 

£ 

total stocks, s^, decreases the optimal retraining costs. 
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The rate of change of retraining costs with respect 



to promotion rates may be computed from 



and. 



rv * 

3s . 



9P* 



* 

s . y . a . 
1 1 x 1 



V a i + PiV 



8d° 



8d l 



8s* 



8s* 

l 

. * 
8p, 



H. TWO-GRADE OPTIMIZATION OF RETRAINING COSTS 

In this section we develop a technique for jointly 
varying the stocks and promotion rates in two grades, i and 
i+1, in such a way that the stock and promotion rates in 
grades below grade i and above grade i+1 need not be changed. 
We then show how the retraining costs in grades i and i+1 
respond to these stock and promotion rate changes. 

Recall the balance equations (4) for grades i, i+1, and 

i+2 : 



i) 


fi I + 


S i-l p i-l = 


★ ★ * * * 

= s.y. (a.+p.a. ) = s.(y.a.) + s.p- (y.a. ) 

1 i x, i r i i' 


i+1) 




* * 


* * 


w 


+ SiPiY^i 


■ s i+i Y i+i <a i+i + Pi+A+i* 


i+2) 




★ * 




W 


+ S i + lPi + l 


r i+l“i+l = s i+2 (a i+2 +p i+2 ) ■ 



We consider s^_^ and as fixed vectors. If s^ is varied, 

then p? must also change in order to satisfy i) . Furthermore, 
the product s?p? cannot remain constant as s? is varied. 

This means that satisfying i+1) when st is varied will require 

* * * 
a change in s^ + ^ or or both. Suppose that as s^ is 
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varied we change the value of s^ + j so that i+1) is satisfied 
and the produce s? + ^p? + ^ i s held constant. This is possible 

and the values of s^ + ^ and p^ + ^ are uniquely determined. By 

, * * ..... 
holding s^ + jP^ + ^ constant equation i+2) is satisfied without 

changing stocks in grade i+2.- Thus we have restricted the 

ic 

effects of varying s^ to grades i and i+1; all other grades 
are unaffected. 

A matter of interest is how the retraining costs vary as 

a consequence of changing stocks by the foregoing technique. 

We have shown in the previous section how retraining costs 

in grade i respond to changes in stocks when the input flow 

is fixed. We next consider how retraining costs in grade 

* 

i+1 respond to changes in when the promotion flow from 
* * 



grade i+1, s ^ + ^p^ + ^y^ + ^°c^ + ^ , is held constant. 
We use three equations: 

1) 



2 ) 



3) 



Balance equation 
* * 

. . . . 1 + £ 

1 + 1 1 ‘ 1 ' 1 ’ 1 



f,.,l + s.p.y.a. = s. Al Y 4 Al (a i+1 +p i+1 a i+1 ) 



i+1 ' i+1 
Retraining flow equation 

r i+l,k - ^i+ljk + s i^i^ik a ik 



- s . 



i+l Y i+l,k ^ a i+l , k + ^i+l a i+l , k^ 



Fixed promotion flow constraint 

s * . t P* . t = constant. 

1 + 1 * 1 +! 



(4' ) 



( 9 ’ ) 



Taking the partial derivative of the balance equation with 
★ 

respect to s^ yields, 



8s 



i+1 



8s 



Y i a i 

Y i+l a i+l 
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The partial derivative of total stocks in grade i+1 with 

respect to total stocks in grade i is negative when s*_^p*_^ 

and s £ + iP* + i are held constant. This may be explained in 

terms of the promotion rate in grade i. Decreasing p* 

causes an increase in s? but a decrease in s?p? . This in 

* 

turn causes a decrease in s^ + ^ . Taking the partial deriva- 
tive of the retraining flow equation then leads to. 



8 r i+l,k = v „ . Tn-l.k a i+l , k 

9s* 1 1 *i+l a i+l 



Y • , cx . , 

1 lk lk 

y . a . 

1 1 i 



) . 



We then have, 



3d 



i+1 



3s 



= Y< a . 



I V? 



i i k i x i+l,k Y i+1 a i+1 



^i+l,k a i+l,k _ ^ik a ik j 



y . a . 

l i 



and, 

3 (dV + d 



i+1 



3s 



K y . , a . , 

V r o 'lk ik , , o o 

V ik y^a^ V ik V i+l,k 



y a., 
lk ik 

y .a. 

'i i 



+ v 



^i+l,k a i+l,k 
i+1 ' k Y i+i a i+i 



-] 



(16) 



This equation provides a rather practical means of guiding 

the planner in his search for an acceptable combination of 

stocks, promotion rates and retraining costs. The scalar 

coefficients of the optimal dual variables in (16) are 

readily computed, and they can be stored in a 3 x K matrix. 

* 

For the value of s^ specified by the planner the optimal dual 
variables for the corresponding retraining transportation 
problem are found and (16) is computed using 3K multiplica- 
tions and additions. The sign of the partial derivative in 
(16) indicates to the planner how he might reduce retraining 
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costs. Whether the planner chooses to make the indicated 
change in total stocks to reduce costs will depend on how he 
perceives his constraints on stocks and promotion rates. At 
any rate he may change the stocks in grades i and i+1 so as 
to decrease retraining costs and confine the effects of the 
stock changes to grades i and i+1. 

A special case of some practical interest is that in 
which retraining costs, 

1) are the same in grades i and i+1, i.e., 
c i (k , m) = c i+1 (k,m) , 

and, 



2) depend only on the group retrained into, i.e., 
c i (k,m) = c i (m) . 

In this special case any primal feasible solution to the 

retraining transportation problem is optimal, and an optimal 

solution to the dual is, 

v?. =0 if keKt 

lk l 

= -c . (k) if keK . . 

l l 



One then has, 



5(d ? + d ?+l> 
3s* 



= Yja. I _ c. (k) ( 

1 kcK. 1 
1 



Y ik a i+l Y i+l,k a i+l,k 



y.a. 
i i 



Y i+1 a i+l 



) 



Roughly speaking, the above equation compares attrition rates 
in grades i and i+1 weighted by retraining costs, and indi- 
cates that costs will be reduced by reducing stocks in the 
grade that has the greater weighted attrition rate. 



I. THE RELATION OF TRAINING COSTS TO TOTAL FLOWS 

In Sections G and H we studied the behavior of the optimal 
retraining costs when the total stock in grade i, st, is the 
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independent variable. This approach has definite practical 

* 

appeal because s^ is likely to be of importance to the planner. 
In this section we shall consider the behavior of the optimal 
retraining costs when there are two independent variables: 
the total number of people entering grade i each period and 
the total number of people promoted out of grade i each 
period. The two scalar variables will be referred to as the 
"total flow into" and the "total flow out of" grade i. It 
should be noted, that the "total flow out of" grade i does 
not include the number of people that leave the system from 
grade i each period. 

We begin with an assumption about the external flows. 

Assumption . For grades i=2,***,n the external flows are 

zero vectors, i.e., f^ - 0, i=2,***,n. There is a known 

r 0 

distribution vector f^ such that the external flow for grade 
1 is always a scalar multiple of f^, i.e., f^ always 

f i ^ 

satisfies = f.. . 

V 

The practical implications of this assumption are that 

hiring is restricted to grade 1, and the vector of people 

hired in grade 1 has a known distribution. The theoretical 

implications of this assumption are that the total flow out 

of grade i is the total flow into grade i+1, and the distri- 

r 0 

bution of any total flow is specified by f^ or (Y^Qcl^) /y^cu • 
(The latter point will be explained.) 
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Let us define a 1 x K vector g .= (g ,g .) by 

1 IX IK 



g. = f. 



when i=l 



= when i=2 , • • • , n . 



We may also write g^ as the product of a total flow and a 
distribution vector, 

g^ = (f 1 l) f x / when i=l 



= (s. , p. , y . , a . , ) 

1-1 * 1 - 1 1 1-1 1 - 1 ' 



?£- l 3a i-l 
*i-l “i-1 



when i=2 , • • • , n . 



where, 

f^I = total flow into grade 1, (a scalar) 

•Hr A 1 

= tota l flow into grade i, i=2,***,n, 

(a scalar) , and the vectors and are 

distribution vectors. Thus the vector of the numbers of 
people entering the various groups in grade i, g^, is always 
equal to a scalar times a known distribution vector. 

By our definitions of "total flow into" and "total flow 
out of" any grade we have, 

gl = total flow into grade i, i=l,*-*,n 

= total flow out of grade i-1, i=2,*-*,n. 

Note that the total flow out of grade n must be zero, so we 
define 9 n+ ^ = 0. 

Notation is simplified somewhat by defining, 

1^ = g^l, a scalar, 

CK = g^ + ]l/ a scalar. 

The mnenomic I for "in" and 0 for " < t" is intended. 
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We define the "flow plane" for grade i as the non- 

2 

negative quadrant of E with points having coordinates (1^, 0^) . 
That is, the total flow into grade i, 1^, and the total flow 
out of grade i, 0^, define a point in the flow plane for 
grade i. 

Lemma 1. For any point (1^, CK) in the flow plane for 

"k Vt 

grade i the values of s^ and p^ that satisfy the balance 
equation are unique. In particular. 



l y • a . 

1 1 l 



(17) 



* = / !LL_ 

P i y • a • I • “ 0. 

'll l l 



) = 



°i 



s .y .a . 

i 1 1 i 



(18) 



Proof. Recall that the balance equation (4) is, 



f i l + s i-l p i-l = s i (a i + p i>* 

But under the assumption on external flows. 



f ii + s.. 1 p i _ 1 = i. , 
so the balance equation can be written, 



But, 



r i = s i a i + s i p i 



S.p. = S*p*Y.a. 
1*1 1 * 1 1 1 1 



= °i' 



and , 



Thus , 



s . a . = s . y • a . 
ii i'll 



h - s i Y i a i + °i • 
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Rearranging terms we have, 



s i = |z i - °i>^i a i • 



The total flow out of grade i is 

0 . = s . p . 

1 1*1 



* * 

= s . p . y • ot . , 

l ' l i 



so, 



°i 



s*y . a . 
l 1 1 l 



Y . a . 0 . 

= ( 1 



Y • ot • I • - 0 . 

'll l l 



) . 



One may verify that substituting the above expressions for 
* * 

s^ and p^ into the balance equation reduces it to an 



identity. 



□ 



Lemma 2. For any point (1^, 0^) in the flow plane for 
grade i the vector of retraining flows r. = (r r . v ) 

1 1 i. IK 

is well defined, and the retraining flow for group k in 
grade i, is given by 

Y ik a ik Y ik a ik. 



Y ik a jk. 



r ik (I i' °i> = 9ik " X i " °i ( 



1 Y i a i - Y i a i 



-) 



Proof. From Section G, equation (14) gives r^ as a function 



r- * 

of s i , 



, * s Y ik a ik x , * , Y ik a ik Y ik a ik x 

r., (s.) = g.l( ) + s-Y-a-( ) • 

ik i' y i r Y i ot i i'i i. Y^i Y i a i 



From Lemma 1 we have 



s.Y.a. = I. - 0. . 

ill l l 
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Thus , 



/t , T /^ik ^ik a ik s , /T n w 'ik J ik ^ik a ik s 

r uc (I i' °i> = h<— - TZ" + (I i - °i> <— T^ - ’ 

1 'l 1 'l 1 'l 1 



= g 



ik 



I ( Zi ^ ilS ) _ o ( ^kk q ik _ ^ik a ik ^ 



i y.a. 
'i i 



i 'y.a. 
'i i 



y.a. 
'i i 



We must establish that g., is a well defined function of I., 

^ik l 

Let e^ be a K x 1 vector having all zero components except 
for a 1 in its k*"* 1 component. Let 
a. 



g^ = f^, when i=l , 

, when i-2, 



Y i-i 0 a i-i 



, n . 



9, 



Y • i a • i 
* 1-1 1-1 

From the introductory discussion the distribution vector 
is known. 









Then g. = (g.l)g. = I.g. , 
and g. k = g.e k 

= (9 i l)9 i e k . 



T ^ 

= I . g . e, . 
i^i k 



□ 



Lemmas 1 and 2 demonstrate that total stock s^ , overall 
promotion rate p? and the retraining flows r^ are uniquely 
determined by the total flows 1^ and (k. It follows that 
the optimal retraining cost in grade i is also uniquely 
determined by these flows. 

The following notation is useful in the proof of theorem 
3 and in subsequent analysis: 

1) 



^ % 
g ik = g ^ e 



i k 



r\j 

where g^ and e^ are as defined in the proof of Lemma 2, 
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2) b ik = 9 



‘ ik 



Y ik a ik 

Yi a i 



3) e ik = Y • a 



Y ik a ik Y ik a ik 



1 1 



Y . a . 

1 i i 



We 



\f 

note that g^, b.^ and are scalars; their values are 



known and they do not depend on 1^ or 0^. 
We then have 



Y ik a ik 



Y ik a ik Y ik a ik 



r lk (I i'°i ) = I i (g ik “ T^a~ + 0 i ( ~ 



i l 



’i a i 



y.a. 

'i i 



= b..I. + B-, 0. 

ik l ik l 



(19) 



Note that the retraining flows in grade i are linear functions 
of the total flows into and out of grade i. 

For any point (1^, 0^) in the flow plane (excluding the 
origin) define the ray through (1^, (K) as R(I^, (K) , i.e., 

R ( I i , 0.) = { (x,y) : for some c>0, x=cl^, y = cCK}. 

Theorem 3. For any point (1^, CK) in the flow plane for 
grade i (excluding the origin) , 

1) the dual optimal solution to the retraining transportation 
problem, v?, is the same for all points in R(I^,CK), 

2) the value of the optimal retraining cost for any point 
in R(I i ,0 i ) is 

d?(cl., cO.) = cd? (I . , 0.), (c>0), 

l l l i i l 

3) the value of p? is constant in R(I^,0^). 

Proof. We begin by writing the dual retraining transportation 
problem with the retraining flows as functions of the total 
flows into and out of grade i. 
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K 

DRTP) max d (I , 0 ) = J r. k d k , O k ) 

k=l 

ST v.. - v. <c.(k,m); kckt, m K, 

lk lm - 1 ' 1 ' k 

V • , / V. UNRESTRICTED, 
lk lm 

From Lemma 2 and equation (19) , 

r ik (I i' °i> “ b ik r i + B ik °i- 
The objective function in DRTP may then be written. 



■W °i> = J^ikh + 6 ik °i> 



‘ v ik 



Let be a solution to DRTP at (1^, (K) , i.e., 

°i> = X lb ik h + B ik °i )v ik- 

At any other point in R(I^, 0^) the objective function in 
DRTP is, 



d i (cI i' c0 i» = J^ik cl i + B ik c0 i» 



i' v ik 



= c J. (b ik h + B ik °i )v ik 
k=l 



= cd i (I i , 0 i ) . 

It is well-known that multiplying the objective function of 
a linear program by a positive constant c, 

1) does not change the value of the variables in the 
optimal solution, 

2) changes the optimal value of the objective function 
by the factor c. 
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Thus we have proven parts 1) and 2) of Theorem 3. 



To prove part 3) we note that from Lemma 1 the value of 



p? at any point in R(I^, CK) is, 

-) , where c>0, 



* - , Y i a i, , 

p i y . a 



c0 i 



i i cl^ - cO^ 



Y • a . 

= (rr—) ( 



0. 

l 



Y • a . ' 'I . - 0 . 

'll l l 



) , 



i.e., the value of p^ is constant in R(I^, 0^) . 



□ 



The flow plane for grade i is sketched in Figure 3. 

The constant-value curves for s? and p? are depicted by dashed 
and solid lines respectively. Note that the ray on which 
I i = ®i corresponds to p. = “ and s^ = 0. The ray on 
which 0^=0 corresponds to p* = 0. 

We see from Theorem 3 that the optimal retraining cost 
is linear on any ray in the flow plane. Next, we consider 
the character of the optimal retraining cost in the region 
bounded by two rays, i.e., a cone in the flow plane. 

Lemma 4. If v? is the dual optimal solution to the retraining 
transportation problem at two points in the flow plane, 

(I|, 0|) and (IV, 0 V ) , then v? is the dual optimal solution 
at any point (1^, 0^) such that for 0 < 0 < 1, 



(I ± , 0 i ) = 0(1! , 0J_) + (1-0) (IV, 0V) . 

Proof. If v? solves the dual retraining transportation 

problem (DRTP) at (I 1 , O') and (I' , O' ) , then, 

i i i i 

1) v? is a feasible solution to DRTP everywhere in the 
flow plane for grade i. 
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Figure 3. Constant-Value Curves for s? and in the 
Flow Plane for Grade i. 



2) ajapop = ! (b ik i: + e ik opv° k 

k=l 



_ K 

3) d°(I", 0") = l (b.. I" + 0”) v?. . 

i l ^=1 i ljv l 1K 



Let, 



K 



l i (I i' 0 A ; v°) = + B ik 0i )v° k 



( 20 ) 
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We must show for 0 1, and 

(I ± , 0 i ) = 0(1/ , 0/) + (1-6) (IV , 07 ), 

that d?(I if 0 i ) = d i (I i , 0 i? v?). 



( 21 ) 



By substituting (21) into (20) we find, 

K 

<3.(1., 0 i? v?) = l (b^tOlH- d-6) IV ] + 6^ [90. + (1-6) Oj* ])v° 
k=l 



= 0d°(l! , 0!) + (1-0) d? (I., 0V ). 

Because the dual optimal solution implies a basis for the 
primal optimal solution, it is readily verified that the sets 
kt and cannot change unless the dual optimal solution 
changes. 

Let solve the primal retraining transportation problem 

(PRTP ) at (I!, 0!), and let x!' solve PRTP at (I !' , 0!’). Then 

i i l i i 

because the optimal values of the primal and dual linear 
programs are equal, we have, 

l l c i (k,m)x| (k,m) = d?(Ij, 0|) , (22) 

kekT mekT 

i i 

£ l c. (k,m)x!* (k,m) =d?(I" 0!'). (23) 

kek . mek. 

i i 

x^(k,m) = 0x|(k,m) + (1-6) x7(k,m) , kekt, mek^ . 

By hypothesis (x|(k,m)} and (x7(k,m) } are feasible in 
PRTP. Thus 
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I x i (k,m) = l (Gx|(k,m) + (l-6)xj' (k,m) ) , kSkt 

meK meK. 

1 



6^ xMk,m) + (1-6) l x V(k,m) 



meK . 



meK . 

1 



er ik (I i- °i> + < 1 -e )r i k (I i- op 



Using equations (19) and (21) we have, 

7 x.(k,m) = r., (I-, 0.), keKt . 

l ik i ' i i 



meK . 



Similarly it can be verified that 

l . x . (k,m) = -r . . (I . , 0 .), meKT . 
keK + 1 ik i i l 

l 

Furthermore, x! (k,m) > 0, x"(k,m) > 0 and 0 < 6 < 1 implies 

l “ i - - - 

that 



0x! (k,m) + (1-6) x|'(k,m) > 0 , 

i l 

i.e., x^(k,m) > 0. We note that this is the only part of 
the proof that uses the condition 0 < 0 < 1. 

Thus we have shown that x^(k,m) is feasible in PRTP. 
Using (20) , (22) and (23) we find that 



l, l c. (k,m)x. (k,m) = 6d?(i; , 0|) + (l-0)d?(I" 0 !' ) 

keK + meK" 1 1 111 1 1 1 



=d.(I., 0., v.). 



So we have found a primal feasible solution (x^(k,m)} and a 
dual feasible solution v° and both give the same value of 
the objective function, so they are optimal solutions. In 
particular, v° is the dual optimal solution at (1^, 0^) as 
was to be shown. □ 
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If solution of the retraining transportation at two 
points in the flow plane results in the same dual optimal 
solution v? , then the proof of Lemma 4 enables us to 
determine the extent of the cone in the flow plane in which 
the dual optimal solution does not change. 

Corollary 5. If two points in the flow plane, (I|, 0|) 
and (IV, 0 V ) , have solutions to PRTP, x| and xV respectively, 
and have the same solution to DRTP, v°, then for any 0 such 
that 

9x| + (l-0)xV > 0, 
the solutions to PRTP and DRTP at 

(I., 0 i ) = 9(1!, 0!) + (1-0) (IV , 0".) 

are, 

0x! + ( 1- 0 ) x V , 

and, 

0 

v . 

1 

respectively . 

Proof. Follows from the proof of Lemma 5. □ 

The foregoing results enable us to describe the optimal 
retraining costs as a function of the total flows into and 
out of grade i. 

Theorem 6. The optimal retraining cost in grade i is a 
piecewise linear convex function of the total flows into and 
out of grade i. 

Proof. The piecewise linearity of th optimal retraining 
costs follows from equation (19), Lemma 4 and Corollary 5. 
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To show the convexity of the optimal costs, it will 
be convenient to use the original retraining cost minimization 
program developed in Section F. 



K K 

PI] min l l c. (k,m) x . (k,m) 
k=l m=l 1 1 

K 

ST l (x . (k,m) - x . (m, k) ) = r.. , k=l,***,K 

m=l 

x^ (k,m) > 0 . 

I II 

Suppose x^ solves PI at ( 1^, 0^) , and x^ solves PI at 
(1^, 0[). Then, 

K K 

d?(l! O'.) = l l c. (k,m)x. (k,m) , (25) 

111 k=l m=l 1 1 

and 



d? (I . , 0 .) = l l c. (k,m)x. (k,m) . (26) 

11 k=l m=l 1 1 



Let, 

I n 

x. = 0x . + (1-6) x . 
l i i 

Then x. >0, and, 

K K i i it 

Y (x. (k,m)-x. (m,k) ) = £ 16 (x. (k,m)-x. (m,k) + (1-6) (x. (k,m) 
m=l rn=l 



x i (m,k) ) ] 



K , K „ ,i 

= 0 l (x. (k,m)— x. (m,k) ) + (1-6) l (x. (k,m)-x. (m,k) ) 
m=l 1 1 m=l 



11 II 



- 6r ik (I i' °i» + °i> 






So x^ is a feasible solution to PI at (1^, 0^) . The value 
of the objective function with this solution is. 
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K K K K , „ 

l l c, (k,m)x. (k,m) = l l c, (k,m) [0x, (k,m) + (l-0)x . (k,m) ] 

k=l m=l k=l m=l 



= 0 d?(l!!, o[) + (l- 0 )d?(I^, 0 [) 

from (25) and (26) . 

But the minimal value of the objective function can be 
no greater than the value resulting from the feasible 
solution x^, i.e., 

d?(I i , 0 i ) < 0d?(I^, o’) + (l- 0 )d°(L, 0^). □ 

We use the assumption that 0^ = I^_^, i=l,***/n-l to 
combine the flow planes. 

Let us define an (n+1) -dimensional "flow space" as the 

nonnegative orthant of E n+ \ and let the i t ^ 1 component of any 

point in this space be the total flow into grade i, i=l,***,n, 
s t 

and the (n+1) component is identically zero. Let, 

n 



d°(I. 



•'W 



J/PV W' 



(27) 



where 



n+1 



0 . 



Theorem 7. The function d° (1^ , * * * , I n+ ^) is convex. It is 
linear on cones in which the optimal dual solutions do not 
change . 



Proof. The sum of convex functions is a convex function, so 
Theorem 6 and the definition of d° (1^ , • • • , I n+ ^ ) indicate it 
is convex. 

We have from Corollary 6 that in each grade i, the 
optimal retraining cost d?(I^, I i + i^ i s linear in any region 
in which the dual optimal solution v? does not change. 



141 



Theorem 3 and Lemma 4 indicate that such regions are two- 
dimensional cones. The intersection of such two-dimensional 
cones in the (n+1 ) -dimensional flov; space defines a cone in 
E n+ 1. The sum of linear functions is linear, so the latter 
part of the theorem follows. c 

It has been shown in this section that under the 
assumption that external flows are restricted to the lowest 
grade, the retraining costs are a convex function of the 
total flows of people between grades. Consequently, in a 
cost minimizing scheme such as that described in Section H, 
any local optimum is also a global optimum provided the set 
of solutions considered feasible is a convex set in the flow 
space . 
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APPENDIX A. SUMMARY OF NOTATION 



For the most part scalars and vectors are denoted by 
lower case letters, and matrices are denoted by upper case 
letters. There are exceptions. 

The subscript i is used to denote the first character- 
istic, grade. The subscript j is used to denote the second 
characteristic . 

A matrix of zeroes is denoted by 0. A vector of zeroes 
is denoted by 0. A vector of ones is denoted by 1. The 
dimensions of 0, 0 and I are implied by the context in which 
they are used. For any vector x we use xl to denote the sum 
of the components of x. 

Unless otherwise indicated the empty sum and the empty 
product are defined as identity elements for the correspond- 
ing operator. For example,, if is a matrix then, 
i-! _ 

][ B m = 0' the additive identity matrix, when k> 1, 
m=k 

i-1 

II B =1, the multiplicative identity matrix, when k>i. 

k m 

When a matrix is displayed in terms of its submatrices, 
the zero submatrices may not be shown explicitly. For 
example , 



* 






Q 1 P 1 




°i p i ° 


Q 2 P 2 


II 


Oil 

O 

to 

to 






= = 


Q 3 




0 0 0 3 


J 
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For any square matrix A we use A^ to denote the matrix 

which has the same elements as A on its main diagonal but 

has zeroes elsewhere. For any matrix B we use B to denote 

J sq 

the matrix which has the square of the elements of B as 
its elements. 

The matrix P^ and the vector s^(t) are defined for 
i=l,*‘*,n. For convenience we may use the convention P^=0 
and s^(t) = 0 for i<l. 

If x and y are two vectors such that 
x = (x 1 .---,x m ) 

and, 

y - ( yi . • - • ,y m ) . 

then we define the congruent multiplication operator, 
by, 

x0y = < x 1 y 1 ,---,x m y m ). 

Congruent multiplication is discussed in some detail in 
R. A. Howard, 1970. 

A "distribution vector" is a nonnegative vector having 
components that sum to one. Row vectors are enclosed in 
parentheses. Column vectors are enclosed in brackets. 

The symbol Q denotes the end of a proof. 
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APPENDIX B. DIAGONAL MATRICES 



Let A be a matrix, not necessarily square, and let 
A (k,j) be the element of A in row i and column j. We say 
that A is a diagonal matrix if for some integer k , A (k,j) 

/i 

^ 0 implies that j-i equals k We call k^ the index of the 
diagonal matrix A. 

The key cell of the diagonal matrix A is defined as: 



(1) 


A 


(1,1) if k A = 0, 








(2) 


A 


d,k A + l) if k A > o. 








(3) 


A 


(l-k A , l) if k A < 0. 








will 


use 


the notation (i A , j A ) 


for 


the row and column 


of 


key 


cell of A. Note that j_ 

J A 


" a a 


= k A 




If A 


is 


a zero matrix, then i 


t is 


a diagonal matrix. 


and 



the value of its index is arbitrary. If A is a non-zero 
diagonal matrix, then its index is unique and its key cell 
is well defined. 

Let A be a non-zero mxn diagonal matrix, and let Z A be 
maximum number of non-zero elements in A. Then it is readily 
verified that 

z A = min (m + min (0, K A ), n - max {0, k A )}. 

All non-zero elements of A are in the set: 

{A(i A + r, j A + r) ; r = 0,l, ## *,z A - l}. 

This set is called the non-zero diagonal of A. 

The purpose of this appendix is to show that diagonal 
matrices can be stored and multiplied in compacted form. 
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If A is an m x n diagonal matrix, then the compacted form 

* 

of A is a vector A v/here, 

A* (1) = m, 

A* (2) = n, 

A* (3) = k A , 

A* (4) = z A , 

* 

and the fifth through last elements of A are the elements of 

the non-zero diagonal of A. 

A zero matrix A may be stored as a four-component 
* 

vector A : 



* 



A 


(1) 




m 


* 


A 

* 


(2) 




n 

* 


A 


(3) 


> 


n or A (3) £ - m 


* 


A 


(4) 


< 


0 . 



We will show that the product of diagonal matrices is 
a diagonal matrix, and that the product is efficiently 
computed using the compacted forms of the multiplicand and 
product matrices. 

Lemma 1 . If A and B are diagonal matrices such that C = AB 
is defined, then C is a diagonal matrix. Furthermore the 
index of C is the sum of the indices of A and B. 

Proof . If C is a zero matrix, then it is a diagonal matrix 
and the value of its index is arbitrary. 

If C is not a zero matrix, then for some row i and column 

j» 

C (i, j) f- 0. 
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row of 



We note that C(i, j) is the inner product of the i fc ^ 

A and the j*"* 1 column of B, and that each of the rows and 

columns of A and B contain at most one non-zero element. 

Consequently, there must exist nonnegative integers r and 

r n such that: 

B 



C ( i , j) = 



A(l A + r A' ^A 



r A> B(i B + r B' j B + r B> 



1 = 



i. + r, 
A A 



J = 



J B + r B 



3 A r A 



X B + r B ' 



Starting with the last equation 
r B = 3 a + r A - *B 



j = j B + j A + r A " *B 



j-l - j„ + 3 A + i B ‘ 



^A + “ ^B^ 



= k A + V 



□ 



Continuing with notation used above, we see that each 
element of the non-zero diagonal of C is the product of 
elements of the non-zero diagonals of A and B. We consider 
next how the elements of the non-zero diagonals of A and B 
are paired to form the non-zero diagonal of C. 

From the proof of the lemma, if C (i,j) j- 0, then for 
some non-negative integers r A and r 0 : 

(1) C(i,j) = A(i A + r A , j A + r A ) B(i B + r B , j 0 + r 0 ) 
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(3) 3 - 3 B + r B 



( 4 ) 



3 fl + r fl “ *6 + r B 



To find the relation between and we note that 
<5 b + r B> - (i A + r A> " 5 - 1 

= k A + k B' 



SO , 



r B - r A = k A + k B - ’B + d A 



* k B 3 B* + + k A* 



But, 



i- + k* = 1 + k A 
A A A 



if k, > 0 
A 



= 1 



and, 



if k A < 0, 






B 



it k B ; ° 



‘ k B - 1 



Thus , 



if k fi < 0. 



r„ - r. = k_ + k A 
BABA 



where k. = k. if k > 0 
A A A 



= 0 otherwise. 



k B = k B if k B < 0 



= 0 otherwise. 



L6t d AB = k B + k A • 

1) If k B + k+ = a AB >0, then r B = + d AB > 0, and 

the first element of the non-zero diagonal of A is paired with 
s t 

the (d AB + 1) element of the non-zero diagonal of B. 
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2) If k g + k* = d AB < 0, then r B = r B - d Afi > 0, and the 
first element of the non-zero diagonal of B is paired with 

i. i- 

the (1-d )*" element of the non-zero diagonal of A. 

Consider the case 




Then , 



r B r A d AB + d AB d AB ' 



and, 



C(i,j) A(i a d AB , 3A“ d AB )B(l B +d AB'^B +d AB ) 



1 = A A " d AB 



^ ^B + d AB 



If k, > 0 or k fi < 0, then, 



A 

1 ) 



2 ) 



d AB " 0 * k B < 0 
^B " 1 



j - 1 



d AB > 0 " k A > 0 

" *A = 1 
-*■ i =1 



31 d AB * 0 " k A “ k B = 0 
* A A = j B = 1 

+ i - j =1 

So if k A > 0 or k fi < 0, then either i or j equals 1 in the 
equation 

C(i,j) = A(i A -a; B , V d AB> B(i B +d AB' V d AB> ’ 
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That is, C(i,j) is the key element of C. 

If K < 0 and k > 0, then d,„ = 0 and 
A B AB 

C ( i , j ) = C(i A ,j B ) = C(l-k A , l+k B ), 

and the first min {-k^, k fi } elements of the non-zero diagonal 
of C are zeroes. 

The foregoing observations lead to the following algorithm 
for multiplying two diagonal matrices. 

1) Compute d^„ = k„ + k* . 

AB B A 

2) Drop the first d^ B elements of the non-zero diagonal 
of A and store the remaining elements in a vector VA. 

3) Drop the first d+ elements of the non-zero diagonal 
of B and store the remaining elements in a vector VB. 

4) Drop elements from the end of the longer of the two 
vectors VA and VB until they are of equal length. Multiply 
corresponding components of these vectors to form a new vector 
VC. 

5) Append max {0, min {-k A ,kg}} zeroes in front of VC. 
Compute z c , the length of the non-zero diagonal of C, and 
append zeroes to the end of VC as necessary to form the non- 
zero diagonal of C. 

6) The first four elements of the compacted form of C 
are A(l), B(2), k + k n , z . The remaining elements are the 
non-zero diagonal of C as computed in 5) . 
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